Creative camera

ABSTRACT

The present disclosure generally relates to displaying visual effects in image data. In some examples, visual effects include an avatar displayed on a user&#39;s face. In some examples, visual effects include stickers applied to image data. In some examples, visual effects include screen effects. In some examples, visual effects are modified based on depth data in the image data.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No.16/143,097, entitled “Creative Camera,” filed Sep. 26, 2018, whichclaims the benefit of: U.S. Provisional Application No. 62/668,227,entitled “Creative Camera,” filed May 7, 2018; and U.S. ProvisionalApplication No. 62/679,934, entitled “Creative Camera,” filed Jun. 3,2018, the contents of which are hereby incorporated by reference intheir entirety.

FIELD

The present disclosure relates generally to computer user interfaces,and more specifically to techniques for displaying visual effects.

BACKGROUND

Visual effects are used to enhance a user's experience when capturingand viewing media using electronic devices. Visual effects can alter theappearance of image data or can represent an idealized or completelyfictional representation of an environment captured in an image.

BRIEF SUMMARY

Some techniques for displaying visual effects using electronic devices,however, are generally cumbersome and inefficient. For example, someexisting techniques use a complex and time-consuming user interface,which may include multiple key presses or keystrokes. Existingtechniques require more time than necessary, wasting user time anddevice energy. This latter consideration is particularly important inbattery-operated devices.

Accordingly, the present technique provides electronic devices withfaster, more efficient methods and interfaces for displaying visualeffects. Such methods and interfaces optionally complement or replaceother methods for displaying visual effects. Such methods and interfacesreduce the cognitive burden on a user and produce a more efficienthuman-machine interface. For battery-operated computing devices, suchmethods and interfaces conserve power and increase the time betweenbattery charges.

A method is described. The method is performed at an electronic devicehaving a camera, a display apparatus, and one or more input devices. Themethod comprises: displaying, via the display apparatus, a messaginguser interface of a message conversation including at least a firstparticipant, the messaging user interface including a camera affordance;detecting, via the one or more input devices, a first input directed tothe camera affordance; in response to detecting the first input,displaying a camera user interface, the camera user interface includinga capture affordance; detecting, via the one or more input devices, asecond input directed to the capture affordance; in response todetecting the second input: capturing image data using the camera;ceasing to display the capture affordance; and displaying a sendaffordance at a location in the camera user interface that waspreviously occupied by the capture affordance; detecting, via the one ormore input devices, a third input directed to the send affordance; andin response to detecting the third input, initiating a process to sendthe captured image data to the first participant.

A non-transitory computer-readable storage medium is described. Thenon-transitory computer-readable storage medium stores one or moreprograms configured to be executed by one or more processors of anelectronic device with a camera, a display apparatus, and one or moreinput devices, the one or more programs including instructions for:displaying, via the display apparatus, a messaging user interface of amessage conversation including at least a first participant, themessaging user interface including a camera affordance; detecting, viathe one or more input devices, a first input directed to the cameraaffordance; in response to detecting the first input, displaying acamera user interface, the camera user interface including a captureaffordance; detecting, via the one or more input devices, a second inputdirected to the capture affordance; in response to detecting the secondinput: capturing image data using the camera; ceasing to display thecapture affordance; and displaying a send affordance at a location inthe camera user interface that was previously occupied by the captureaffordance; detecting, via the one or more input devices, a third inputdirected to the send affordance; and in response to detecting the thirdinput, initiating a process to send the captured image data to the firstparticipant.

A transitory computer-readable storage medium is described. Thetransitory computer-readable storage medium stores one or more programsconfigured to be executed by one or more processors of an electronicdevice with a camera, a display apparatus, and one or more inputdevices, the one or more programs including instructions for:displaying, via the display apparatus, a messaging user interface of amessage conversation including at least a first participant, themessaging user interface including a camera affordance; detecting, viathe one or more input devices, a first input directed to the cameraaffordance; in response to detecting the first input, displaying acamera user interface, the camera user interface including a captureaffordance; detecting, via the one or more input devices, a second inputdirected to the capture affordance; in response to detecting the secondinput: capturing image data using the camera; ceasing to display thecapture affordance; and displaying a send affordance at a location inthe camera user interface that was previously occupied by the captureaffordance; detecting, via the one or more input devices, a third inputdirected to the send affordance; and in response to detecting the thirdinput, initiating a process to send the captured image data to the firstparticipant.

An electronic device is described. The electronic device comprises: acamera; a display apparatus; one or more input devices; one or moreprocessors; and memory storing one or more programs configured to beexecuted by the one or more processors, the one or more programsincluding instructions for: displaying, via the display apparatus, amessaging user interface of a message conversation including at least afirst participant, the messaging user interface including a cameraaffordance; detecting, via the one or more input devices, a first inputdirected to the camera affordance; in response to detecting the firstinput, displaying a camera user interface, the camera user interfaceincluding a capture affordance; detecting, via the one or more inputdevices, a second input directed to the capture affordance; in responseto detecting the second input: capturing image data using the camera;ceasing to display the capture affordance; and displaying a sendaffordance at a location in the camera user interface that waspreviously occupied by the capture affordance; detecting, via the one ormore input devices, a third input directed to the send affordance; andin response to detecting the third input, initiating a process to sendthe captured image data to the first participant.

An electronic device is described. The electronic device comprises: acamera; a display apparatus; one or more input devices; means fordisplaying, via the display apparatus, a messaging user interface of amessage conversation including at least a first participant, themessaging user interface including a camera affordance; means fordetecting, via the one or more input devices, a first input directed tothe camera affordance; means, responsive to detecting the first input,for displaying a camera user interface, the camera user interfaceincluding a capture affordance; means for detecting, via the one or moreinput devices, a second input directed to the capture affordance; means,responsive to detecting the second input, for: capturing image datausing the camera; ceasing to display the capture affordance; anddisplaying a send affordance at a location in the camera user interfacethat was previously occupied by the capture affordance; means fordetecting, via the one or more input devices, a third input directed tothe send affordance; and means, responsive to detecting the third input,for initiating a process to send the captured image data to the firstparticipant.

A method is described. The method is performed at an electronic devicehaving a camera and a display apparatus. The method comprises:displaying, via the display apparatus, a camera user interface, thecamera user interface including: a camera display region including arepresentation of image data captured via the camera; and a firstaffordance associated with a first camera display mode; while a subjectis positioned within a field of view of the camera and a representationof the subject and a background are displayed in the camera displayregion, detecting a gesture directed to the first affordance; inresponse to detecting the gesture directed to the first affordance,activating the first camera display mode, wherein activating the firstcamera display mode includes: displaying an avatar selection regionincluding a selected one of a plurality of avatar options; anddisplaying a representation of the selected avatar option on therepresentation of the subject in the camera display region; while thefirst camera display mode is active, detecting a change in pose of thesubject; and in response to detecting the change in pose of the subject,changing an appearance of the displayed representation of the selectedavatar option based on the detected change in pose of the subject whilemaintaining display of the background.

A non-transitory computer-readable storage medium is described. Thenon-transitory computer-readable storage medium stores one or moreprograms configured to be executed by one or more processors of anelectronic device with a camera and a display apparatus, the one or moreprograms including instructions for: displaying, via the displayapparatus, a camera user interface, the camera user interface including:a camera display region including a representation of image datacaptured via the camera; and a first affordance associated with a firstcamera display mode; while a subject is positioned within a field ofview of the camera and a representation of the subject and a backgroundare displayed in the camera display region, detecting a gesture directedto the first affordance; in response to detecting the gesture directedto the first affordance, activating the first camera display mode,wherein activating the first camera display mode includes: displaying anavatar selection region including a selected one of a plurality ofavatar options; and displaying a representation of the selected avataroption on the representation of the subject in the camera displayregion; while the first camera display mode is active, detecting achange in pose of the subject; and in response to detecting the changein pose of the subject, changing an appearance of the displayedrepresentation of the selected avatar option based on the detectedchange in pose of the subject while maintaining display of thebackground.

A transitory computer-readable storage medium is described. Thetransitory computer-readable storage medium stores one or more programsconfigured to be executed by one or more processors of an electronicdevice with a camera and a display apparatus, the one or more programsincluding instructions for: displaying, via the display apparatus, acamera user interface, the camera user interface including: a cameradisplay region including a representation of image data captured via thecamera; and a first affordance associated with a first camera displaymode; while a subject is positioned within a field of view of the cameraand a representation of the subject and a background are displayed inthe camera display region, detecting a gesture directed to the firstaffordance; in response to detecting the gesture directed to the firstaffordance, activating the first camera display mode, wherein activatingthe first camera display mode includes: displaying an avatar selectionregion including a selected one of a plurality of avatar options; anddisplaying a representation of the selected avatar option on therepresentation of the subject in the camera display region; while thefirst camera display mode is active, detecting a change in pose of thesubject; and in response to detecting the change in pose of the subject,changing an appearance of the displayed representation of the selectedavatar option based on the detected change in pose of the subject whilemaintaining display of the background.

An electronic device is described. The electronic device comprises: acamera; a display apparatus; one or more processors; and memory storingone or more programs configured to be executed by the one or moreprocessors, the one or more programs including instructions for:displaying, via the display apparatus, a camera user interface, thecamera user interface including: a camera display region including arepresentation of image data captured via the camera; and a firstaffordance associated with a first camera display mode; while a subjectis positioned within a field of view of the camera and a representationof the subject and a background are displayed in the camera displayregion, detecting a gesture directed to the first affordance; inresponse to detecting the gesture directed to the first affordance,activating the first camera display mode, wherein activating the firstcamera display mode includes: displaying an avatar selection regionincluding a selected one of a plurality of avatar options; anddisplaying a representation of the selected avatar option on therepresentation of the subject in the camera display region; while thefirst camera display mode is active, detecting a change in pose of thesubject; and in response to detecting the change in pose of the subject,changing an appearance of the displayed representation of the selectedavatar option based on the detected change in pose of the subject whilemaintaining display of the background.

An electronic device is described. The electronic device comprises: acamera; a display apparatus; one or more input devices; means fordisplaying, via the display apparatus, a camera user interface, thecamera user interface including: a camera display region including arepresentation of image data captured via the camera; and a firstaffordance associated with a first camera display mode; means, while asubject is positioned within a field of view of the camera and arepresentation of the subject and a background are displayed in thecamera display region, for detecting a gesture directed to the firstaffordance; means, responsive to detecting the gesture directed to thefirst affordance, for activating the first camera display mode, whereinactivating the first camera display mode includes: displaying an avatarselection region including a selected one of a plurality of avataroptions; and displaying a representation of the selected avatar optionon the representation of the subject in the camera display region;means, while the first camera display mode is active, for detecting achange in pose of the subject; and means, responsive to detecting thechange in pose of the subject, for changing an appearance of thedisplayed representation of the selected avatar option based on thedetected change in pose of the subject while maintaining display of thebackground.

A method is described. The method is performed at an electronic devicehaving a display apparatus. The method comprises: displaying, via thedisplay apparatus, a media user interface, the media user interfaceincluding: a media display region including a representation of a mediaitem; and an effects affordance; detecting a gesture directed to theeffects affordance; in response to detecting the gesture directed to theeffects affordance, displaying a plurality of effects options forapplying effects to the media item concurrently with a representation ofthe media item, including: in accordance with a determination that themedia item is associated with corresponding depth data, the plurality ofeffects options include a respective effects option for applying effectsbased on the depth data; and in accordance with a determination that theimage data does not include the depth data, the respective effectsoption is not available for activation in the plurality of effectsoptions.

A non-transitory computer-readable storage medium is described. Thenon-transitory computer-readable storage medium stores one or moreprograms configured to be executed by one or more processors of anelectronic device with a display apparatus, the one or more programsincluding instructions for: displaying, via the display apparatus, amedia user interface, the media user interface including: a mediadisplay region including a representation of a media item; and aneffects affordance; detecting a gesture directed to the effectsaffordance; in response to detecting the gesture directed to the effectsaffordance, displaying a plurality of effects options for applyingeffects to the media item concurrently with a representation of themedia item, including: in accordance with a determination that the mediaitem is associated with corresponding depth data, the plurality ofeffects options include a respective effects option for applying effectsbased on the depth data; and in accordance with a determination that theimage data does not include the depth data, the respective effectsoption is not available for activation in the plurality of effectsoptions.

A transitory computer-readable storage medium is described. Thetransitory computer-readable storage medium stores one or more programsconfigured to be executed by one or more processors of an electronicdevice with a display apparatus, the one or more programs includinginstructions for: displaying, via the display apparatus, a media userinterface, the media user interface including: a media display regionincluding a representation of a media item; and an effects affordance;detecting a gesture directed to the effects affordance; in response todetecting the gesture directed to the effects affordance, displaying aplurality of effects options for applying effects to the media itemconcurrently with a representation of the media item, including: inaccordance with a determination that the media item is associated withcorresponding depth data, the plurality of effects options include arespective effects option for applying effects based on the depth data;and in accordance with a determination that the image data does notinclude the depth data, the respective effects option is not availablefor activation in the plurality of effects options.

An electronic device is described. The electronic device comprises: adisplay apparatus; one or more processors; and memory storing one ormore programs configured to be executed by the one or more processors,the one or more programs including instructions for: displaying, via thedisplay apparatus, a media user interface, the media user interfaceincluding: a media display region including a representation of a mediaitem; and an effects affordance; detecting a gesture directed to theeffects affordance; in response to detecting the gesture directed to theeffects affordance, displaying a plurality of effects options forapplying effects to the media item concurrently with a representation ofthe media item, including: in accordance with a determination that themedia item is associated with corresponding depth data, the plurality ofeffects options include a respective effects option for applying effectsbased on the depth data; and in accordance with a determination that theimage data does not include the depth data, the respective effectsoption is not available for activation in the plurality of effectsoptions.

An electronic device is described. The electronic device comprises: adisplay apparatus; one or more input devices; means for displaying, viathe display apparatus, a media user interface, the media user interfaceincluding: a media display region including a representation of a mediaitem; and an effects affordance; means for detecting a gesture directedto the effects affordance; means, responsive to detecting the gesturedirected to the effects affordance, for displaying a plurality ofeffects options for applying effects to the media item concurrently witha representation of the media item, including: in accordance with adetermination that the media item is associated with corresponding depthdata, the plurality of effects options include a respective effectsoption for applying effects based on the depth data; and in accordancewith a determination that the image data does not include the depthdata, the respective effects option is not available for activation inthe plurality of effects options.

A method is described. The method is performed at an electronic devicehaving a display apparatus. The method comprises: displaying, via thedisplay apparatus, a live video communication user interface of a livevideo communication application, the live video communication userinterface including: a representation of a subject participating in alive video communication session, and a first affordance; detecting agesture directed to the first affordance; and in response to detectingthe gesture directed to the first affordance: activating a cameraeffects mode; and increasing a size of the representation of the subjectparticipating in the live video communication session.

A non-transitory computer-readable storage medium is described. Thenon-transitory computer-readable storage medium stores one or moreprograms configured to be executed by one or more processors of anelectronic device with a display apparatus, the one or more programsincluding instructions for: displaying, via the display apparatus, alive video communication user interface of a live video communicationapplication, the live video communication user interface including: arepresentation of a subject participating in a live video communicationsession, and a first affordance; detecting a gesture directed to thefirst affordance; and in response to detecting the gesture directed tothe first affordance: activating a camera effects mode; and increasing asize of the representation of the subject participating in the livevideo communication session.

A transitory computer-readable storage medium is described. Thetransitory computer-readable storage medium stores one or more programsconfigured to be executed by one or more processors of an electronicdevice with a display apparatus, the one or more programs includinginstructions for: displaying, via the display apparatus, a live videocommunication user interface of a live video communication application,the live video communication user interface including: a representationof a subject participating in a live video communication session, and afirst affordance; detecting a gesture directed to the first affordance;and in response to detecting the gesture directed to the firstaffordance: activating a camera effects mode; and increasing a size ofthe representation of the subject participating in the live videocommunication session.

An electronic device is described. The electronic device comprises: adisplay apparatus; one or more processors; and memory storing one ormore programs configured to be executed by the one or more processors,the one or more programs including instructions for: displaying, via thedisplay apparatus, a live video communication user interface of a livevideo communication application, the live video communication userinterface including: a representation of a subject participating in alive video communication session, and a first affordance; detecting agesture directed to the first affordance; and in response to detectingthe gesture directed to the first affordance: activating a cameraeffects mode; and increasing a size of the representation of the subjectparticipating in the live video communication session.

An electronic device is described. The electronic device comprises: adisplay apparatus; one or more input devices; means for displaying, viathe display apparatus, a live video communication user interface of alive video communication application, the live video communication userinterface including: a representation of a subject participating in alive video communication session, and a first affordance; means fordetecting a gesture directed to the first affordance; and means,responsive to detecting the gesture directed to the first affordance,for: activating a camera effects mode; and increasing a size of therepresentation of the subject participating in the live videocommunication session.

A method is described. The method is performed at an electronic devicehaving a camera and a display apparatus. The method comprises:displaying, via the display apparatus, a representation of image datacaptured via the one or more cameras, wherein the representationincludes a representation of a subject and the image data corresponds todepth data that includes depth data for the subject; displaying, via thedisplay apparatus, a representation of a virtual avatar that isdisplayed in place of at least a portion of the representation of thesubject, wherein the virtual avatar is placed at simulated depthrelative to the representation of the subject as determined based on thedepth data for the subject, displaying the representation of the virtualavatar includes: in accordance with a determination, based on the depthdata, that a first portion of the virtual avatar satisfies a set ofdepth-based display criteria, wherein the depth-based display criteriainclude a requirement that the depth data for the subject indicate thatthe first portion of the virtual avatar has a simulated depth that is infront of a corresponding first portion of the subject, in order for thedepth-based display criteria to be met, including as part of therepresentation of the virtual avatar, a representation of the firstportion of the virtual avatar that is displayed in place of the firstportion of the subject; and in accordance with a determination, based onthe depth data, that the first portion of the virtual avatar does notsatisfy the set of depth-based display criteria for the first portion ofthe subject, excluding, from the representation of the virtual avatar,the representation of the first portion of the virtual avatar anddisplaying the first portion of the subject in the region that wouldhave been occupied by the first portion of the virtual avatar.

A non-transitory computer-readable storage medium is described. Thenon-transitory computer-readable storage medium stores one or moreprograms configured to be executed by one or more processors of anelectronic device with a camera and a display apparatus, the one or moreprograms including instructions for: displaying, via the displayapparatus, a representation of image data captured via the one or morecameras, wherein the representation includes a representation of asubject and the image data corresponds to depth data that includes depthdata for the subject; displaying, via the display apparatus, arepresentation of a virtual avatar that is displayed in place of atleast a portion of the representation of the subject, wherein thevirtual avatar is placed at simulated depth relative to therepresentation of the subject as determined based on the depth data forthe subject, displaying the representation of the virtual avatarincludes: in accordance with a determination, based on the depth data,that a first portion of the virtual avatar satisfies a set ofdepth-based display criteria, wherein the depth-based display criteriainclude a requirement that the depth data for the subject indicate thatthe first portion of the virtual avatar has a simulated depth that is infront of a corresponding first portion of the subject, in order for thedepth-based display criteria to be met, including as part of therepresentation of the virtual avatar, a representation of the firstportion of the virtual avatar that is displayed in place of the firstportion of the subject; and in accordance with a determination, based onthe depth data, that the first portion of the virtual avatar does notsatisfy the set of depth-based display criteria for the first portion ofthe subject, excluding, from the representation of the virtual avatar,the representation of the first portion of the virtual avatar anddisplaying the first portion of the subject in the region that wouldhave been occupied by the first portion of the virtual avatar.

A transitory computer-readable storage medium is described. Thetransitory computer-readable storage medium stores one or more programsconfigured to be executed by one or more processors of an electronicdevice with a camera and a display apparatus, the one or more programsincluding instructions for: displaying, via the display apparatus, arepresentation of image data captured via the one or more cameras,wherein the representation includes a representation of a subject andthe image data corresponds to depth data that includes depth data forthe subject; displaying, via the display apparatus, a representation ofa virtual avatar that is displayed in place of at least a portion of therepresentation of the subject, wherein the virtual avatar is placed atsimulated depth relative to the representation of the subject asdetermined based on the depth data for the subject, displaying therepresentation of the virtual avatar includes: in accordance with adetermination, based on the depth data, that a first portion of thevirtual avatar satisfies a set of depth-based display criteria, whereinthe depth-based display criteria include a requirement that the depthdata for the subject indicate that the first portion of the virtualavatar has a simulated depth that is in front of a corresponding firstportion of the subject, in order for the depth-based display criteria tobe met, including as part of the representation of the virtual avatar, arepresentation of the first portion of the virtual avatar that isdisplayed in place of the first portion of the subject; and inaccordance with a determination, based on the depth data, that the firstportion of the virtual avatar does not satisfy the set of depth-baseddisplay criteria for the first portion of the subject, excluding, fromthe representation of the virtual avatar, the representation of thefirst portion of the virtual avatar and displaying the first portion ofthe subject in the region that would have been occupied by the firstportion of the virtual avatar.

An electronic device is described. The electronic device comprises: acamera; a display apparatus; one or more processors; and memory storingone or more programs configured to be executed by the one or moreprocessors, the one or more programs including instructions for:displaying, via the display apparatus, a representation of image datacaptured via the one or more cameras, wherein the representationincludes a representation of a subject and the image data corresponds todepth data that includes depth data for the subject; displaying, via thedisplay apparatus, a representation of a virtual avatar that isdisplayed in place of at least a portion of the representation of thesubject, wherein the virtual avatar is placed at simulated depthrelative to the representation of the subject as determined based on thedepth data for the subject, displaying the representation of the virtualavatar includes: in accordance with a determination, based on the depthdata, that a first portion of the virtual avatar satisfies a set ofdepth-based display criteria, wherein the depth-based display criteriainclude a requirement that the depth data for the subject indicate thatthe first portion of the virtual avatar has a simulated depth that is infront of a corresponding first portion of the subject, in order for thedepth-based display criteria to be met, including as part of therepresentation of the virtual avatar, a representation of the firstportion of the virtual avatar that is displayed in place of the firstportion of the subject; and in accordance with a determination, based onthe depth data, that the first portion of the virtual avatar does notsatisfy the set of depth-based display criteria for the first portion ofthe subject, excluding, from the representation of the virtual avatar,the representation of the first portion of the virtual avatar anddisplaying the first portion of the subject in the region that wouldhave been occupied by the first portion of the virtual avatar.

An electronic device is described. The electronic device comprises: acamera; and a display apparatus; means for displaying, via the displayapparatus, a representation of image data captured via the one or morecameras, wherein the representation includes a representation of asubject and the image data corresponds to depth data that includes depthdata for the subject; means for displaying, via the display apparatus, arepresentation of a virtual avatar that is displayed in place of atleast a portion of the representation of the subject, wherein thevirtual avatar is placed at simulated depth relative to therepresentation of the subject as determined based on the depth data forthe subject, displaying the representation of the virtual avatarincludes: means in accordance with a determination, based on the depthdata, that a first portion of the virtual avatar satisfies a set ofdepth-based display criteria, wherein the depth-based display criteriainclude a requirement that the depth data for the subject indicate thatthe first portion of the virtual avatar has a simulated depth that is infront of a corresponding first portion of the subject, in order for thedepth-based display criteria to be met, for including as part of therepresentation of the virtual avatar, a representation of the firstportion of the virtual avatar that is displayed in place of the firstportion of the subject; and means in accordance with a determination,based on the depth data, that the first portion of the virtual avatardoes not satisfy the set of depth-based display criteria for the firstportion of the subject, for excluding, from the representation of thevirtual avatar, the representation of the first portion of the virtualavatar and displaying the first portion of the subject in the regionthat would have been occupied by the first portion of the virtualavatar.

Executable instructions for performing these functions are, optionally,included in a non-transitory computer-readable storage medium or othercomputer program product configured for execution by one or moreprocessors. Executable instructions for performing these functions are,optionally, included in a transitory computer-readable storage medium orother computer program product configured for execution by one or moreprocessors.

Thus, devices are provided with faster, more efficient methods andinterfaces for displaying visual effects, thereby increasing theeffectiveness, efficiency, and user satisfaction with such devices. Suchmethods and interfaces may complement or replace other methods fordisplaying visual effects.

DESCRIPTION OF THE FIGURES

For a better understanding of the various described embodiments,reference should be made to the Description of Embodiments below, inconjunction with the following drawings in which like reference numeralsrefer to corresponding parts throughout the figures.

FIG. 1A is a block diagram illustrating a portable multifunction devicewith a touch-sensitive display in accordance with some embodiments.

FIG. 1B is a block diagram illustrating exemplary components for eventhandling in accordance with some embodiments.

FIG. 2 illustrates a portable multifunction device having a touch screenin accordance with some embodiments.

FIG. 3 is a block diagram of an exemplary multifunction device with adisplay and a touch-sensitive surface in accordance with someembodiments.

FIG. 4A illustrates an exemplary user interface for a menu ofapplications on a portable multifunction device in accordance with someembodiments.

FIG. 4B illustrates an exemplary user interface for a multifunctiondevice with a touch-sensitive surface that is separate from the displayin accordance with some embodiments.

FIG. 5A illustrates a personal electronic device in accordance with someembodiments.

FIG. 5B is a block diagram illustrating a personal electronic device inaccordance with some embodiments.

FIGS. 6A-6BQ illustrate exemplary user interfaces for displaying visualeffects in a messaging application.

FIGS. 7A and 7B are a flow diagram illustrating a method for displayingvisual effects in a messaging application.

FIGS. 8A-8BQ illustrate exemplary user interfaces for displaying visualeffects in a camera application.

FIGS. 9A and 9B are a flow diagram illustrating a method for displayingvisual effects in a camera application.

FIGS. 10A-10AL illustrate exemplary user interfaces for displayingvisual effects in a media item viewing mode.

FIGS. 11A and 11B are a flow diagram illustrating a method fordisplaying visual effects in a media item viewing mode.

FIGS. 12A-12AP illustrate exemplary user interfaces for displayingvisual effects in a live video communication session.

FIGS. 13A and 13B are a flow diagram illustrating a method fordisplaying visual effects in a live video communication session.

FIGS. 14A-14M illustrate exemplary user interfaces for displaying visualeffects in a camera application.

FIGS. 15A and 15B are a flow diagram illustrating a method fordisplaying visual effects in a camera application.

DESCRIPTION OF EMBODIMENTS

The following description sets forth exemplary methods, parameters, andthe like. It should be recognized, however, that such description is notintended as a limitation on the scope of the present disclosure but isinstead provided as a description of exemplary embodiments.

There is a need for electronic devices that provide efficient methodsand interfaces for displaying visual effects. For example, whileprograms already exist for displaying visual effects, these programs areinefficient and difficult to use compared to the techniques below, whichallow a user to displaying visual effects in various applications. Suchtechniques can reduce the cognitive burden on a user who displays visualeffects in an application, thereby enhancing productivity. Further, suchtechniques can reduce processor and battery power otherwise wasted onredundant user inputs.

Below, FIGS. 1A-1B, 2, 3, 4A-4B, and 5A-5B provide a description ofexemplary devices for performing the techniques for managing eventnotifications.

FIGS. 6A-6BQ illustrate exemplary user interfaces for displaying visualeffects in a messaging application. FIGS. 7A and 7B are a flow diagramillustrating methods of displaying visual effects in a messagingapplication in accordance with some embodiments. The user interfaces inFIGS. 6A-6BQ are used to illustrate the processes described below,including the processes in FIGS. 7A and 7B.

FIGS. 8A-8BQ illustrate exemplary user interfaces for displaying visualeffects in a camera application. FIGS. 9A and 9B are a flow diagramillustrating methods of displaying visual effects in a cameraapplication in accordance with some embodiments. The user interfaces inFIGS. 8A-8BQ are used to illustrate the processes described below,including the processes in FIGS. 9A and 9B.

FIGS. 10A-10AL illustrate exemplary user interfaces for displayingvisual effects in a camera application. FIGS. 11A and 11B are a flowdiagram illustrating methods of displaying visual effects in a mediaitem viewing mode in accordance with some embodiments. The userinterfaces in FIGS. 10A-10AL are used to illustrate the processesdescribed below, including the processes in FIGS. 11A and 11B.

FIGS. 12A-12AP illustrate exemplary user interfaces for displayingvisual effects in a live video communication session. FIGS. 13A and 13Bare a flow diagram illustrating methods of displaying visual effects ina live video communication session in accordance with some embodiments.The user interfaces in FIGS. 12A-12AP are used to illustrate theprocesses described below, including the processes in FIGS. 13A and 13B.

FIGS. 14A-14M illustrate exemplary user interfaces for displaying visualeffects in a camera application. FIGS. 15A and 15B are a flow diagramillustrating methods of displaying visual effects in a cameraapplication in accordance with some embodiments. The user interfaces inFIGS. 14A-14M are used to illustrate the processes described below,including the processes in FIGS. 15A and 15B.

Although the following description uses terms “first,” “second,” etc. todescribe various elements, these elements should not be limited by theterms. These terms are only used to distinguish one element fromanother. For example, a first touch could be termed a second touch, and,similarly, a second touch could be termed a first touch, withoutdeparting from the scope of the various described embodiments. The firsttouch and the second touch are both touches, but they are not the sametouch.

The terminology used in the description of the various describedembodiments herein is for the purpose of describing particularembodiments only and is not intended to be limiting. As used in thedescription of the various described embodiments and the appendedclaims, the singular forms “a,” “an,” and “the” are intended to includethe plural forms as well, unless the context clearly indicatesotherwise. It will also be understood that the term “and/or” as usedherein refers to and encompasses any and all possible combinations ofone or more of the associated listed items. It will be furtherunderstood that the terms “includes,” “including,” “comprises,” and/or“comprising,” when used in this specification, specify the presence ofstated features, integers, steps, operations, elements, and/orcomponents, but do not preclude the presence or addition of one or moreother features, integers, steps, operations, elements, components,and/or groups thereof.

The term “if” is, optionally, construed to mean “when” or “upon” or “inresponse to determining” or “in response to detecting,” depending on thecontext. Similarly, the phrase “if it is determined” or “if [a statedcondition or event] is detected” is, optionally, construed to mean “upondetermining” or “in response to determining” or “upon detecting [thestated condition or event]” or “in response to detecting [the statedcondition or event],” depending on the context.

Embodiments of electronic devices, user interfaces for such devices, andassociated processes for using such devices are described. In someembodiments, the device is a portable communications device, such as amobile telephone, that also contains other functions, such as PDA and/ormusic player functions. Exemplary embodiments of portable multifunctiondevices include, without limitation, the iPhone®, iPod Touch®, and iPad®devices from Apple Inc. of Cupertino, Calif. Other portable electronicdevices, such as laptops or tablet computers with touch-sensitivesurfaces (e.g., touch screen displays and/or touchpads), are,optionally, used. It should also be understood that, in someembodiments, the device is not a portable communications device, but isa desktop computer with a touch-sensitive surface (e.g., a touch screendisplay and/or a touchpad).

In the discussion that follows, an electronic device that includes adisplay and a touch-sensitive surface is described. It should beunderstood, however, that the electronic device optionally includes oneor more other physical user-interface devices, such as a physicalkeyboard, a mouse, and/or a joystick.

The device typically supports a variety of applications, such as one ormore of the following: a drawing application, a presentationapplication, a word processing application, a website creationapplication, a disk authoring application, a spreadsheet application, agaming application, a telephone application, a video conferencingapplication, an e-mail application, an instant messaging application, aworkout support application, a photo management application, a digitalcamera application, a digital video camera application, a web browsingapplication, a digital music player application, and/or a digital videoplayer application.

The various applications that are executed on the device optionally useat least one common physical user-interface device, such as thetouch-sensitive surface. One or more functions of the touch-sensitivesurface as well as corresponding information displayed on the deviceare, optionally, adjusted and/or varied from one application to the nextand/or within a respective application. In this way, a common physicalarchitecture (such as the touch-sensitive surface) of the deviceoptionally supports the variety of applications with user interfacesthat are intuitive and transparent to the user.

Attention is now directed toward embodiments of portable devices withtouch-sensitive displays. FIG. 1A is a block diagram illustratingportable multifunction device 100 with touch-sensitive display system112 in accordance with some embodiments. Touch-sensitive display 112 issometimes called a “touch screen” for convenience and is sometimes knownas or called a “touch-sensitive display system.” Device 100 includesmemory 102 (which optionally includes one or more computer-readablestorage mediums), memory controller 122, one or more processing units(CPUs) 120, peripherals interface 118, RF circuitry 108, audio circuitry110, speaker 111, microphone 113, input/output (I/O) subsystem 106,other input control devices 116, and external port 124. Device 100optionally includes one or more optical sensors 164. Device 100optionally includes one or more contact intensity sensors 165 fordetecting intensity of contacts on device 100 (e.g., a touch-sensitivesurface such as touch-sensitive display system 112 of device 100).Device 100 optionally includes one or more tactile output generators 167for generating tactile outputs on device 100 (e.g., generating tactileoutputs on a touch-sensitive surface such as touch-sensitive displaysystem 112 of device 100 or touchpad 355 of device 300). Thesecomponents optionally communicate over one or more communication busesor signal lines 103.

As used in the specification and claims, the term “intensity” of acontact on a touch-sensitive surface refers to the force or pressure(force per unit area) of a contact (e.g., a finger contact) on thetouch-sensitive surface, or to a substitute (proxy) for the force orpressure of a contact on the touch-sensitive surface. The intensity of acontact has a range of values that includes at least four distinctvalues and more typically includes hundreds of distinct values (e.g., atleast 256). Intensity of a contact is, optionally, determined (ormeasured) using various approaches and various sensors or combinationsof sensors. For example, one or more force sensors underneath oradjacent to the touch-sensitive surface are, optionally, used to measureforce at various points on the touch-sensitive surface. In someimplementations, force measurements from multiple force sensors arecombined (e.g., a weighted average) to determine an estimated force of acontact. Similarly, a pressure-sensitive tip of a stylus is, optionally,used to determine a pressure of the stylus on the touch-sensitivesurface. Alternatively, the size of the contact area detected on thetouch-sensitive surface and/or changes thereto, the capacitance of thetouch-sensitive surface proximate to the contact and/or changes thereto,and/or the resistance of the touch-sensitive surface proximate to thecontact and/or changes thereto are, optionally, used as a substitute forthe force or pressure of the contact on the touch-sensitive surface. Insome implementations, the substitute measurements for contact force orpressure are used directly to determine whether an intensity thresholdhas been exceeded (e.g., the intensity threshold is described in unitscorresponding to the substitute measurements). In some implementations,the substitute measurements for contact force or pressure are convertedto an estimated force or pressure, and the estimated force or pressureis used to determine whether an intensity threshold has been exceeded(e.g., the intensity threshold is a pressure threshold measured in unitsof pressure). Using the intensity of a contact as an attribute of a userinput allows for user access to additional device functionality that mayotherwise not be accessible by the user on a reduced-size device withlimited real estate for displaying affordances (e.g., on atouch-sensitive display) and/or receiving user input (e.g., via atouch-sensitive display, a touch-sensitive surface, or aphysical/mechanical control such as a knob or a button).

As used in the specification and claims, the term “tactile output”refers to physical displacement of a device relative to a previousposition of the device, physical displacement of a component (e.g., atouch-sensitive surface) of a device relative to another component(e.g., housing) of the device, or displacement of the component relativeto a center of mass of the device that will be detected by a user withthe user's sense of touch. For example, in situations where the deviceor the component of the device is in contact with a surface of a userthat is sensitive to touch (e.g., a finger, palm, or other part of auser's hand), the tactile output generated by the physical displacementwill be interpreted by the user as a tactile sensation corresponding toa perceived change in physical characteristics of the device or thecomponent of the device. For example, movement of a touch-sensitivesurface (e.g., a touch-sensitive display or trackpad) is, optionally,interpreted by the user as a “down click” or “up click” of a physicalactuator button. In some cases, a user will feel a tactile sensationsuch as an “down click” or “up click” even when there is no movement ofa physical actuator button associated with the touch-sensitive surfacethat is physically pressed (e.g., displaced) by the user's movements. Asanother example, movement of the touch-sensitive surface is, optionally,interpreted or sensed by the user as “roughness” of the touch-sensitivesurface, even when there is no change in smoothness of thetouch-sensitive surface. While such interpretations of touch by a userwill be subject to the individualized sensory perceptions of the user,there are many sensory perceptions of touch that are common to a largemajority of users. Thus, when a tactile output is described ascorresponding to a particular sensory perception of a user (e.g., an “upclick,” a “down click,” “roughness”), unless otherwise stated, thegenerated tactile output corresponds to physical displacement of thedevice or a component thereof that will generate the described sensoryperception for a typical (or average) user.

It should be appreciated that device 100 is only one example of aportable multifunction device, and that device 100 optionally has moreor fewer components than shown, optionally combines two or morecomponents, or optionally has a different configuration or arrangementof the components. The various components shown in FIG. 1A areimplemented in hardware, software, or a combination of both hardware andsoftware, including one or more signal processing and/orapplication-specific integrated circuits.

Memory 102 optionally includes high-speed random access memory andoptionally also includes non-volatile memory, such as one or moremagnetic disk storage devices, flash memory devices, or othernon-volatile solid-state memory devices. Memory controller 122optionally controls access to memory 102 by other components of device100.

Peripherals interface 118 can be used to couple input and outputperipherals of the device to CPU 120 and memory 102. The one or moreprocessors 120 run or execute various software programs and/or sets ofinstructions stored in memory 102 to perform various functions fordevice 100 and to process data. In some embodiments, peripheralsinterface 118, CPU 120, and memory controller 122 are, optionally,implemented on a single chip, such as chip 104. In some otherembodiments, they are, optionally, implemented on separate chips.

RF (radio frequency) circuitry 108 receives and sends RF signals, alsocalled electromagnetic signals. RF circuitry 108 converts electricalsignals to/from electromagnetic signals and communicates withcommunications networks and other communications devices via theelectromagnetic signals. RF circuitry 108 optionally includes well-knowncircuitry for performing these functions, including but not limited toan antenna system, an RF transceiver, one or more amplifiers, a tuner,one or more oscillators, a digital signal processor, a CODEC chipset, asubscriber identity module (SIM) card, memory, and so forth. RFcircuitry 108 optionally communicates with networks, such as theInternet, also referred to as the World Wide Web (WWW), an intranetand/or a wireless network, such as a cellular telephone network, awireless local area network (LAN) and/or a metropolitan area network(MAN), and other devices by wireless communication. The RF circuitry 108optionally includes well-known circuitry for detecting near fieldcommunication (NFC) fields, such as by a short-range communicationradio. The wireless communication optionally uses any of a plurality ofcommunications standards, protocols, and technologies, including but notlimited to Global System for Mobile Communications (GSM), Enhanced DataGSM Environment (EDGE), high-speed downlink packet access (HSDPA),high-speed uplink packet access (HSUPA), Evolution, Data-Only (EV-DO),HSPA, HSPA+, Dual-Cell HSPA (DC-HSPDA), long term evolution (LTE), nearfield communication (NFC), wideband code division multiple access(W-CDMA), code division multiple access (CDMA), time division multipleaccess (TDMA), Bluetooth, Bluetooth Low Energy (BTLE), Wireless Fidelity(Wi-Fi) (e.g., IEEE 802.11a, IEEE 802.11b, IEEE 802.11g, IEEE 802.11n,and/or IEEE 802.11ac), voice over Internet Protocol (VoIP), Wi-MAX, aprotocol for e-mail (e.g., Internet message access protocol (IMAP)and/or post office protocol (POP)), instant messaging (e.g., extensiblemessaging and presence protocol (XMPP), Session Initiation Protocol forInstant Messaging and Presence Leveraging Extensions (SIMPLE), InstantMessaging and Presence Service (IMPS)), and/or Short Message Service(SMS), or any other suitable communication protocol, includingcommunication protocols not yet developed as of the filing date of thisdocument.

Audio circuitry 110, speaker 111, and microphone 113 provide an audiointerface between a user and device 100. Audio circuitry 110 receivesaudio data from peripherals interface 118, converts the audio data to anelectrical signal, and transmits the electrical signal to speaker 111.Speaker 111 converts the electrical signal to human-audible sound waves.Audio circuitry 110 also receives electrical signals converted bymicrophone 113 from sound waves. Audio circuitry 110 converts theelectrical signal to audio data and transmits the audio data toperipherals interface 118 for processing. Audio data is, optionally,retrieved from and/or transmitted to memory 102 and/or RF circuitry 108by peripherals interface 118. In some embodiments, audio circuitry 110also includes a headset jack (e.g., 212, FIG. 2). The headset jackprovides an interface between audio circuitry 110 and removable audioinput/output peripherals, such as output-only headphones or a headsetwith both output (e.g., a headphone for one or both ears) and input(e.g., a microphone).

I/O subsystem 106 couples input/output peripherals on device 100, suchas touch screen 112 and other input control devices 116, to peripheralsinterface 118. I/O subsystem 106 optionally includes display controller156, optical sensor controller 158, depth camera controller 169,intensity sensor controller 159, haptic feedback controller 161, and oneor more input controllers 160 for other input or control devices. Theone or more input controllers 160 receive/send electrical signalsfrom/to other input control devices 116. The other input control devices116 optionally include physical buttons (e.g., push buttons, rockerbuttons, etc.), dials, slider switches, joysticks, click wheels, and soforth. In some alternate embodiments, input controller(s) 160 are,optionally, coupled to any (or none) of the following: a keyboard, aninfrared port, a USB port, and a pointer device such as a mouse. The oneor more buttons (e.g., 208, FIG. 2) optionally include an up/down buttonfor volume control of speaker 111 and/or microphone 113. The one or morebuttons optionally include a push button (e.g., 206, FIG. 2).

A quick press of the push button optionally disengages a lock of touchscreen 112 or optionally begins a process that uses gestures on thetouch screen to unlock the device, as described in U.S. patentapplication Ser. No. 11/322,549, “Unlocking a Device by PerformingGestures on an Unlock Image,” filed Dec. 23, 2005, U.S. Pat. No.7,657,849, which is hereby incorporated by reference in its entirety. Alonger press of the push button (e.g., 206) optionally turns power todevice 100 on or off. The functionality of one or more of the buttonsare, optionally, user-customizable. Touch screen 112 is used toimplement virtual or soft buttons and one or more soft keyboards.

Touch-sensitive display 112 provides an input interface and an outputinterface between the device and a user. Display controller 156 receivesand/or sends electrical signals from/to touch screen 112. Touch screen112 displays visual output to the user. The visual output optionallyincludes graphics, text, icons, video, and any combination thereof(collectively termed “graphics”). In some embodiments, some or all ofthe visual output optionally corresponds to user-interface objects.

Touch screen 112 has a touch-sensitive surface, sensor, or set ofsensors that accepts input from the user based on haptic and/or tactilecontact. Touch screen 112 and display controller 156 (along with anyassociated modules and/or sets of instructions in memory 102) detectcontact (and any movement or breaking of the contact) on touch screen112 and convert the detected contact into interaction withuser-interface objects (e.g., one or more soft keys, icons, web pages,or images) that are displayed on touch screen 112. In an exemplaryembodiment, a point of contact between touch screen 112 and the usercorresponds to a finger of the user.

Touch screen 112 optionally uses LCD (liquid crystal display)technology, LPD (light emitting polymer display) technology, or LED(light emitting diode) technology, although other display technologiesare used in other embodiments. Touch screen 112 and display controller156 optionally detect contact and any movement or breaking thereof usingany of a plurality of touch sensing technologies now known or laterdeveloped, including but not limited to capacitive, resistive, infrared,and surface acoustic wave technologies, as well as other proximitysensor arrays or other elements for determining one or more points ofcontact with touch screen 112. In an exemplary embodiment, projectedmutual capacitance sensing technology is used, such as that found in theiPhone® and iPod Touch® from Apple Inc. of Cupertino, Calif.

A touch-sensitive display in some embodiments of touch screen 112 is,optionally, analogous to the multi-touch sensitive touchpads describedin the following U.S. Pat. No. 6,323,846 (Westerman et al.), U.S. Pat.No. 6,570,557 (Westerman et al.), and/or U.S. Pat. No. 6,677,932(Westerman), and/or U.S. Patent Publication 2002/0015024A1, each ofwhich is hereby incorporated by reference in its entirety. However,touch screen 112 displays visual output from device 100, whereastouch-sensitive touchpads do not provide visual output.

A touch-sensitive display in some embodiments of touch screen 112 isdescribed in the following applications: (1) U.S. patent applicationSer. No. 11/381,313, “Multipoint Touch Surface Controller,” filed May 2,2006; (2) U.S. patent application Ser. No. 10/840,862, “MultipointTouchscreen,” filed May 6, 2004; (3) U.S. patent application Ser. No.10/903,964, “Gestures For Touch Sensitive Input Devices,” filed Jul. 30,2004; (4) U.S. patent application Ser. No. 11/048,264, “Gestures ForTouch Sensitive Input Devices,” filed Jan. 31, 2005; (5) U.S. patentapplication Ser. No. 11/038,590, “Mode-Based Graphical User InterfacesFor Touch Sensitive Input Devices,” filed Jan. 18, 2005; (6) U.S. patentapplication Ser. No. 11/228,758, “Virtual Input Device Placement On ATouch Screen User Interface,” filed Sep. 16, 2005; (7) U.S. patentapplication Ser. No. 11/228,700, “Operation Of A Computer With A TouchScreen Interface,” filed Sep. 16, 2005; (8) U.S. patent application Ser.No. 11/228,737, “Activating Virtual Keys Of A Touch-Screen VirtualKeyboard,” filed Sep. 16, 2005; and (9) U.S. patent application Ser. No.11/367,749, “Multi-Functional Hand-Held Device,” filed Mar. 3, 2006. Allof these applications are incorporated by reference herein in theirentirety.

Touch screen 112 optionally has a video resolution in excess of 100 dpi.In some embodiments, the touch screen has a video resolution ofapproximately 160 dpi. The user optionally makes contact with touchscreen 112 using any suitable object or appendage, such as a stylus, afinger, and so forth. In some embodiments, the user interface isdesigned to work primarily with finger-based contacts and gestures,which can be less precise than stylus-based input due to the larger areaof contact of a finger on the touch screen. In some embodiments, thedevice translates the rough finger-based input into a precisepointer/cursor position or command for performing the actions desired bythe user.

In some embodiments, in addition to the touch screen, device 100optionally includes a touchpad for activating or deactivating particularfunctions. In some embodiments, the touchpad is a touch-sensitive areaof the device that, unlike the touch screen, does not display visualoutput. The touchpad is, optionally, a touch-sensitive surface that isseparate from touch screen 112 or an extension of the touch-sensitivesurface formed by the touch screen.

Device 100 also includes power system 162 for powering the variouscomponents. Power system 162 optionally includes a power managementsystem, one or more power sources (e.g., battery, alternating current(AC)), a recharging system, a power failure detection circuit, a powerconverter or inverter, a power status indicator (e.g., a light-emittingdiode (LED)) and any other components associated with the generation,management and distribution of power in portable devices.

Device 100 optionally also includes one or more optical sensors 164.FIG. 1A shows an optical sensor coupled to optical sensor controller 158in I/O subsystem 106. Optical sensor 164 optionally includescharge-coupled device (CCD) or complementary metal-oxide semiconductor(CMOS) phototransistors. Optical sensor 164 receives light from theenvironment, projected through one or more lenses, and converts thelight to data representing an image. In conjunction with imaging module143 (also called a camera module), optical sensor 164 optionallycaptures still images or video. In some embodiments, an optical sensoris located on the back of device 100, opposite touch screen display 112on the front of the device so that the touch screen display is enabledfor use as a viewfinder for still and/or video image acquisition. Insome embodiments, an optical sensor is located on the front of thedevice so that the user's image is, optionally, obtained for videoconferencing while the user views the other video conferenceparticipants on the touch screen display. In some embodiments, theposition of optical sensor 164 can be changed by the user (e.g., byrotating the lens and the sensor in the device housing) so that a singleoptical sensor 164 is used along with the touch screen display for bothvideo conferencing and still and/or video image acquisition.

Device 100 optionally also includes one or more depth camera sensors175. FIG. 1A shows a depth camera sensor coupled to depth cameracontroller 169 in I/O subsystem 106. Depth camera sensor 175 receivesdata from the environment to create a three dimensional model of anobject (e.g., a face) within a scene from a viewpoint (e.g., a depthcamera sensor). In some embodiments, in conjunction with imaging module143 (also called a camera module), depth camera sensor 175 is optionallyused to determine a depth map of different portions of an image capturedby the imaging module 143. In some embodiments, a depth camera sensor islocated on the front of device 100 so that the user's image with depthinformation is, optionally, obtained for video conferencing while theuser views the other video conference participants on the touch screendisplay and to capture selfies with depth map data. In some embodiments,the depth camera sensor 175 is located on the back of device, or on theback and the front of the device 100. In some embodiments, the positionof depth camera sensor 175 can be changed by the user (e.g., by rotatingthe lens and the sensor in the device housing) so that a depth camerasensor 175 is used along with the touch screen display for both videoconferencing and still and/or video image acquisition.

In some embodiments, a depth map (e.g., depth map image) containsinformation (e.g., values) that relates to the distance of objects in ascene from a viewpoint (e.g., a camera, an optical sensor, a depthcamera sensor). In one embodiment of a depth map, each depth pixeldefines the position in the viewpoint's Z-axis where its correspondingtwo-dimensional pixel is located. In some embodiments, a depth map iscomposed of pixels wherein each pixel is defined by a value (e.g.,0-255). For example, the “0” value represents pixels that are located atthe most distant place in a “three dimensional” scene and the “255”value represents pixels that are located closest to a viewpoint (e.g., acamera, an optical sensor, a depth camera sensor) in the “threedimensional” scene. In other embodiments, a depth map represents thedistance between an object in a scene and the plane of the viewpoint. Insome embodiments, the depth map includes information about the relativedepth of various features of an object of interest in view of the depthcamera (e.g., the relative depth of eyes, nose, mouth, ears of a user'sface). In some embodiments, the depth map includes information thatenables the device to determine contours of the object of interest in az direction.

Device 100 optionally also includes one or more contact intensitysensors 165. FIG. 1A shows a contact intensity sensor coupled tointensity sensor controller 159 in I/O subsystem 106. Contact intensitysensor 165 optionally includes one or more piezoresistive strain gauges,capacitive force sensors, electric force sensors, piezoelectric forcesensors, optical force sensors, capacitive touch-sensitive surfaces, orother intensity sensors (e.g., sensors used to measure the force (orpressure) of a contact on a touch-sensitive surface). Contact intensitysensor 165 receives contact intensity information (e.g., pressureinformation or a proxy for pressure information) from the environment.In some embodiments, at least one contact intensity sensor is collocatedwith, or proximate to, a touch-sensitive surface (e.g., touch-sensitivedisplay system 112). In some embodiments, at least one contact intensitysensor is located on the back of device 100, opposite touch screendisplay 112, which is located on the front of device 100.

Device 100 optionally also includes one or more proximity sensors 166.FIG. 1A shows proximity sensor 166 coupled to peripherals interface 118.Alternately, proximity sensor 166 is, optionally, coupled to inputcontroller 160 in I/O subsystem 106. Proximity sensor 166 optionallyperforms as described in U.S. patent application Ser. No. 11/241,839,“Proximity Detector In Handheld Device”; Ser. No. 11/240,788, “ProximityDetector In Handheld Device”; Ser. No. 11/620,702, “Using Ambient LightSensor To Augment Proximity Sensor Output”; Ser. No. 11/586,862,“Automated Response To And Sensing Of User Activity In PortableDevices”; and Ser. No. 11/638,251, “Methods And Systems For AutomaticConfiguration Of Peripherals,” which are hereby incorporated byreference in their entirety. In some embodiments, the proximity sensorturns off and disables touch screen 112 when the multifunction device isplaced near the user's ear (e.g., when the user is making a phone call).

Device 100 optionally also includes one or more tactile outputgenerators 167. FIG. 1A shows a tactile output generator coupled tohaptic feedback controller 161 in I/O subsystem 106. Tactile outputgenerator 167 optionally includes one or more electroacoustic devicessuch as speakers or other audio components and/or electromechanicaldevices that convert energy into linear motion such as a motor,solenoid, electroactive polymer, piezoelectric actuator, electrostaticactuator, or other tactile output generating component (e.g., acomponent that converts electrical signals into tactile outputs on thedevice). Contact intensity sensor 165 receives tactile feedbackgeneration instructions from haptic feedback module 133 and generatestactile outputs on device 100 that are capable of being sensed by a userof device 100. In some embodiments, at least one tactile outputgenerator is collocated with, or proximate to, a touch-sensitive surface(e.g., touch-sensitive display system 112) and, optionally, generates atactile output by moving the touch-sensitive surface vertically (e.g.,in/out of a surface of device 100) or laterally (e.g., back and forth inthe same plane as a surface of device 100). In some embodiments, atleast one tactile output generator sensor is located on the back ofdevice 100, opposite touch screen display 112, which is located on thefront of device 100.

Device 100 optionally also includes one or more accelerometers 168. FIG.1A shows accelerometer 168 coupled to peripherals interface 118.Alternately, accelerometer 168 is, optionally, coupled to an inputcontroller 160 in I/O subsystem 106. Accelerometer 168 optionallyperforms as described in U.S. Patent Publication No. 20050190059,“Acceleration-based Theft Detection System for Portable ElectronicDevices,” and U.S. Patent Publication No. 20060017692, “Methods AndApparatuses For Operating A Portable Device Based On An Accelerometer,”both of which are incorporated by reference herein in their entirety. Insome embodiments, information is displayed on the touch screen displayin a portrait view or a landscape view based on an analysis of datareceived from the one or more accelerometers. Device 100 optionallyincludes, in addition to accelerometer(s) 168, a magnetometer and a GPS(or GLONASS or other global navigation system) receiver for obtaininginformation concerning the location and orientation (e.g., portrait orlandscape) of device 100.

In some embodiments, the software components stored in memory 102include operating system 126, communication module (or set ofinstructions) 128, contact/motion module (or set of instructions) 130,graphics module (or set of instructions) 132, text input module (or setof instructions) 134, Global Positioning System (GPS) module (or set ofinstructions) 135, and applications (or sets of instructions) 136.Furthermore, in some embodiments, memory 102 (FIG. 1A) or 370 (FIG. 3)stores device/global internal state 157, as shown in FIGS. 1A and 3.Device/global internal state 157 includes one or more of: activeapplication state, indicating which applications, if any, are currentlyactive; display state, indicating what applications, views or otherinformation occupy various regions of touch screen display 112; sensorstate, including information obtained from the device's various sensorsand input control devices 116; and location information concerning thedevice's location and/or attitude.

Operating system 126 (e.g., Darwin, RTXC, LINUX, UNIX, OS X, iOS,WINDOWS, or an embedded operating system such as VxWorks) includesvarious software components and/or drivers for controlling and managinggeneral system tasks (e.g., memory management, storage device control,power management, etc.) and facilitates communication between varioushardware and software components.

Communication module 128 facilitates communication with other devicesover one or more external ports 124 and also includes various softwarecomponents for handling data received by RF circuitry 108 and/orexternal port 124. External port 124 (e.g., Universal Serial Bus (USB),FIREWIRE, etc.) is adapted for coupling directly to other devices orindirectly over a network (e.g., the Internet, wireless LAN, etc.). Insome embodiments, the external port is a multi-pin (e.g., 30-pin)connector that is the same as, or similar to and/or compatible with, the30-pin connector used on iPod® (trademark of Apple Inc.) devices.

Contact/motion module 130 optionally detects contact with touch screen112 (in conjunction with display controller 156) and othertouch-sensitive devices (e.g., a touchpad or physical click wheel).Contact/motion module 130 includes various software components forperforming various operations related to detection of contact, such asdetermining if contact has occurred (e.g., detecting a finger-downevent), determining an intensity of the contact (e.g., the force orpressure of the contact or a substitute for the force or pressure of thecontact), determining if there is movement of the contact and trackingthe movement across the touch-sensitive surface (e.g., detecting one ormore finger-dragging events), and determining if the contact has ceased(e.g., detecting a finger-up event or a break in contact).Contact/motion module 130 receives contact data from the touch-sensitivesurface. Determining movement of the point of contact, which isrepresented by a series of contact data, optionally includes determiningspeed (magnitude), velocity (magnitude and direction), and/or anacceleration (a change in magnitude and/or direction) of the point ofcontact. These operations are, optionally, applied to single contacts(e.g., one finger contacts) or to multiple simultaneous contacts (e.g.,“multitouch”/multiple finger contacts). In some embodiments,contact/motion module 130 and display controller 156 detect contact on atouchpad.

In some embodiments, contact/motion module 130 uses a set of one or moreintensity thresholds to determine whether an operation has beenperformed by a user (e.g., to determine whether a user has “clicked” onan icon). In some embodiments, at least a subset of the intensitythresholds are determined in accordance with software parameters (e.g.,the intensity thresholds are not determined by the activation thresholdsof particular physical actuators and can be adjusted without changingthe physical hardware of device 100). For example, a mouse “click”threshold of a trackpad or touch screen display can be set to any of alarge range of predefined threshold values without changing the trackpador touch screen display hardware. Additionally, in some implementations,a user of the device is provided with software settings for adjustingone or more of the set of intensity thresholds (e.g., by adjustingindividual intensity thresholds and/or by adjusting a plurality ofintensity thresholds at once with a system-level click “intensity”parameter).

Contact/motion module 130 optionally detects a gesture input by a user.Different gestures on the touch-sensitive surface have different contactpatterns (e.g., different motions, timings, and/or intensities ofdetected contacts). Thus, a gesture is, optionally, detected bydetecting a particular contact pattern. For example, detecting a fingertap gesture includes detecting a finger-down event followed by detectinga finger-up (liftoff) event at the same position (or substantially thesame position) as the finger-down event (e.g., at the position of anicon). As another example, detecting a finger swipe gesture on thetouch-sensitive surface includes detecting a finger-down event followedby detecting one or more finger-dragging events, and subsequentlyfollowed by detecting a finger-up (liftoff) event.

Graphics module 132 includes various known software components forrendering and displaying graphics on touch screen 112 or other display,including components for changing the visual impact (e.g., brightness,transparency, saturation, contrast, or other visual property) ofgraphics that are displayed. As used herein, the term “graphics”includes any object that can be displayed to a user, including, withoutlimitation, text, web pages, icons (such as user-interface objectsincluding soft keys), digital images, videos, animations, and the like.

In some embodiments, graphics module 132 stores data representinggraphics to be used. Each graphic is, optionally, assigned acorresponding code. Graphics module 132 receives, from applicationsetc., one or more codes specifying graphics to be displayed along with,if necessary, coordinate data and other graphic property data, and thengenerates screen image data to output to display controller 156.

Haptic feedback module 133 includes various software components forgenerating instructions used by tactile output generator(s) 167 toproduce tactile outputs at one or more locations on device 100 inresponse to user interactions with device 100.

Text input module 134, which is, optionally, a component of graphicsmodule 132, provides soft keyboards for entering text in variousapplications (e.g., contacts 137, e-mail 140, IM 141, browser 147, andany other application that needs text input).

GPS module 135 determines the location of the device and provides thisinformation for use in various applications (e.g., to telephone 138 foruse in location-based dialing; to camera 143 as picture/video metadata;and to applications that provide location-based services such as weatherwidgets, local yellow page widgets, and map/navigation widgets).

Applications 136 optionally include the following modules (or sets ofinstructions), or a subset or superset thereof:

-   -   Contacts module 137 (sometimes called an address book or contact        list);    -   Telephone module 138;    -   Video conference module 139;    -   E-mail client module 140;    -   Instant messaging (IM) module 141;    -   Workout support module 142;    -   Camera module 143 for still and/or video images;    -   Image management module 144;    -   Video player module;    -   Music player module;    -   Browser module 147;    -   Calendar module 148;    -   Widget modules 149, which optionally include one or more of:        weather widget 149-1, stocks widget 149-2, calculator widget        149-3, alarm clock widget 149-4, dictionary widget 149-5, and        other widgets obtained by the user, as well as user-created        widgets 149-6;    -   Widget creator module 150 for making user-created widgets 149-6;    -   Search module 151;    -   Video and music player module 152, which merges video player        module and music player module;    -   Notes module 153;    -   Map module 154; and/or    -   Online video module 155.

Examples of other applications 136 that are, optionally, stored inmemory 102 include other word processing applications, other imageediting applications, drawing applications, presentation applications,JAVA-enabled applications, encryption, digital rights management, voicerecognition, and voice replication.

In conjunction with touch screen 112, display controller 156,contact/motion module 130, graphics module 132, and text input module134, contacts module 137 are, optionally, used to manage an address bookor contact list (e.g., stored in application internal state 192 ofcontacts module 137 in memory 102 or memory 370), including: addingname(s) to the address book; deleting name(s) from the address book;associating telephone number(s), e-mail address(es), physicaladdress(es) or other information with a name; associating an image witha name; categorizing and sorting names; providing telephone numbers ore-mail addresses to initiate and/or facilitate communications bytelephone 138, video conference module 139, e-mail 140, or IM 141; andso forth.

In conjunction with RF circuitry 108, audio circuitry 110, speaker 111,microphone 113, touch screen 112, display controller 156, contact/motionmodule 130, graphics module 132, and text input module 134, telephonemodule 138 are optionally, used to enter a sequence of characterscorresponding to a telephone number, access one or more telephonenumbers in contacts module 137, modify a telephone number that has beenentered, dial a respective telephone number, conduct a conversation, anddisconnect or hang up when the conversation is completed. As notedabove, the wireless communication optionally uses any of a plurality ofcommunications standards, protocols, and technologies.

In conjunction with RF circuitry 108, audio circuitry 110, speaker 111,microphone 113, touch screen 112, display controller 156, optical sensor164, optical sensor controller 158, contact/motion module 130, graphicsmodule 132, text input module 134, contacts module 137, and telephonemodule 138, video conference module 139 includes executable instructionsto initiate, conduct, and terminate a video conference between a userand one or more other participants in accordance with user instructions.

In conjunction with RF circuitry 108, touch screen 112, displaycontroller 156, contact/motion module 130, graphics module 132, and textinput module 134, e-mail client module 140 includes executableinstructions to create, send, receive, and manage e-mail in response touser instructions. In conjunction with image management module 144,e-mail client module 140 makes it very easy to create and send e-mailswith still or video images taken with camera module 143.

In conjunction with RF circuitry 108, touch screen 112, displaycontroller 156, contact/motion module 130, graphics module 132, and textinput module 134, the instant messaging module 141 includes executableinstructions to enter a sequence of characters corresponding to aninstant message, to modify previously entered characters, to transmit arespective instant message (for example, using a Short Message Service(SMS) or Multimedia Message Service (MMS) protocol for telephony-basedinstant messages or using XMPP, SIMPLE, or IMPS for Internet-basedinstant messages), to receive instant messages, and to view receivedinstant messages. In some embodiments, transmitted and/or receivedinstant messages optionally include graphics, photos, audio files, videofiles and/or other attachments as are supported in an MMS and/or anEnhanced Messaging Service (EMS). As used herein, “instant messaging”refers to both telephony-based messages (e.g., messages sent using SMSor MMS) and Internet-based messages (e.g., messages sent using XMPP,SIMPLE, or IMPS).

In conjunction with RF circuitry 108, touch screen 112, displaycontroller 156, contact/motion module 130, graphics module 132, textinput module 134, GPS module 135, map module 154, and music playermodule, workout support module 142 includes executable instructions tocreate workouts (e.g., with time, distance, and/or calorie burninggoals); communicate with workout sensors (sports devices); receiveworkout sensor data; calibrate sensors used to monitor a workout; selectand play music for a workout; and display, store, and transmit workoutdata.

In conjunction with touch screen 112, display controller 156, opticalsensor(s) 164, optical sensor controller 158, contact/motion module 130,graphics module 132, and image management module 144, camera module 143includes executable instructions to capture still images or video(including a video stream) and store them into memory 102, modifycharacteristics of a still image or video, or delete a still image orvideo from memory 102.

In conjunction with touch screen 112, display controller 156,contact/motion module 130, graphics module 132, text input module 134,and camera module 143, image management module 144 includes executableinstructions to arrange, modify (e.g., edit), or otherwise manipulate,label, delete, present (e.g., in a digital slide show or album), andstore still and/or video images.

In conjunction with RF circuitry 108, touch screen 112, displaycontroller 156, contact/motion module 130, graphics module 132, and textinput module 134, browser module 147 includes executable instructions tobrowse the Internet in accordance with user instructions, includingsearching, linking to, receiving, and displaying web pages or portionsthereof, as well as attachments and other files linked to web pages.

In conjunction with RF circuitry 108, touch screen 112, displaycontroller 156, contact/motion module 130, graphics module 132, textinput module 134, e-mail client module 140, and browser module 147,calendar module 148 includes executable instructions to create, display,modify, and store calendars and data associated with calendars (e.g.,calendar entries, to-do lists, etc.) in accordance with userinstructions.

In conjunction with RF circuitry 108, touch screen 112, displaycontroller 156, contact/motion module 130, graphics module 132, textinput module 134, and browser module 147, widget modules 149 aremini-applications that are, optionally, downloaded and used by a user(e.g., weather widget 149-1, stocks widget 149-2, calculator widget149-3, alarm clock widget 149-4, and dictionary widget 149-5) or createdby the user (e.g., user-created widget 149-6). In some embodiments, awidget includes an HTML (Hypertext Markup Language) file, a CSS(Cascading Style Sheets) file, and a JavaScript file. In someembodiments, a widget includes an XML (Extensible Markup Language) fileand a JavaScript file (e.g., Yahoo! Widgets).

In conjunction with RF circuitry 108, touch screen 112, displaycontroller 156, contact/motion module 130, graphics module 132, textinput module 134, and browser module 147, the widget creator module 150are, optionally, used by a user to create widgets (e.g., turning auser-specified portion of a web page into a widget).

In conjunction with touch screen 112, display controller 156,contact/motion module 130, graphics module 132, and text input module134, search module 151 includes executable instructions to search fortext, music, sound, image, video, and/or other files in memory 102 thatmatch one or more search criteria (e.g., one or more user-specifiedsearch terms) in accordance with user instructions.

In conjunction with touch screen 112, display controller 156,contact/motion module 130, graphics module 132, audio circuitry 110,speaker 111, RF circuitry 108, and browser module 147, video and musicplayer module 152 includes executable instructions that allow the userto download and play back recorded music and other sound files stored inone or more file formats, such as MP3 or AAC files, and executableinstructions to display, present, or otherwise play back videos (e.g.,on touch screen 112 or on an external, connected display via externalport 124). In some embodiments, device 100 optionally includes thefunctionality of an MP3 player, such as an iPod (trademark of AppleInc.).

In conjunction with touch screen 112, display controller 156,contact/motion module 130, graphics module 132, and text input module134, notes module 153 includes executable instructions to create andmanage notes, to-do lists, and the like in accordance with userinstructions.

In conjunction with RF circuitry 108, touch screen 112, displaycontroller 156, contact/motion module 130, graphics module 132, textinput module 134, GPS module 135, and browser module 147, map module 154are, optionally, used to receive, display, modify, and store maps anddata associated with maps (e.g., driving directions, data on stores andother points of interest at or near a particular location, and otherlocation-based data) in accordance with user instructions.

In conjunction with touch screen 112, display controller 156,contact/motion module 130, graphics module 132, audio circuitry 110,speaker 111, RF circuitry 108, text input module 134, e-mail clientmodule 140, and browser module 147, online video module 155 includesinstructions that allow the user to access, browse, receive (e.g., bystreaming and/or download), play back (e.g., on the touch screen or onan external, connected display via external port 124), send an e-mailwith a link to a particular online video, and otherwise manage onlinevideos in one or more file formats, such as H.264. In some embodiments,instant messaging module 141, rather than e-mail client module 140, isused to send a link to a particular online video. Additional descriptionof the online video application can be found in U.S. Provisional PatentApplication No. 60/936,562, “Portable Multifunction Device, Method, andGraphical User Interface for Playing Online Videos,” filed Jun. 20,2007, and U.S. patent application Ser. No. 11/968,067, “PortableMultifunction Device, Method, and Graphical User Interface for PlayingOnline Videos,” filed Dec. 31, 2007, the contents of which are herebyincorporated by reference in their entirety.

Each of the above-identified modules and applications corresponds to aset of executable instructions for performing one or more functionsdescribed above and the methods described in this application (e.g., thecomputer-implemented methods and other information processing methodsdescribed herein). These modules (e.g., sets of instructions) need notbe implemented as separate software programs, procedures, or modules,and thus various subsets of these modules are, optionally, combined orotherwise rearranged in various embodiments. For example, video playermodule is, optionally, combined with music player module into a singlemodule (e.g., video and music player module 152, FIG. 1A). In someembodiments, memory 102 optionally stores a subset of the modules anddata structures identified above. Furthermore, memory 102 optionallystores additional modules and data structures not described above.

In some embodiments, device 100 is a device where operation of apredefined set of functions on the device is performed exclusivelythrough a touch screen and/or a touchpad. By using a touch screen and/ora touchpad as the primary input control device for operation of device100, the number of physical input control devices (such as push buttons,dials, and the like) on device 100 is, optionally, reduced.

The predefined set of functions that are performed exclusively through atouch screen and/or a touchpad optionally include navigation betweenuser interfaces. In some embodiments, the touchpad, when touched by theuser, navigates device 100 to a main, home, or root menu from any userinterface that is displayed on device 100. In such embodiments, a “menubutton” is implemented using a touchpad. In some other embodiments, themenu button is a physical push button or other physical input controldevice instead of a touchpad.

FIG. 1B is a block diagram illustrating exemplary components for eventhandling in accordance with some embodiments. In some embodiments,memory 102 (FIG. 1A) or 370 (FIG. 3) includes event sorter 170 (e.g., inoperating system 126) and a respective application 136-1 (e.g., any ofthe aforementioned applications 137-151, 155, 380-390).

Event sorter 170 receives event information and determines theapplication 136-1 and application view 191 of application 136-1 to whichto deliver the event information. Event sorter 170 includes eventmonitor 171 and event dispatcher module 174. In some embodiments,application 136-1 includes application internal state 192, whichindicates the current application view(s) displayed on touch-sensitivedisplay 112 when the application is active or executing. In someembodiments, device/global internal state 157 is used by event sorter170 to determine which application(s) is (are) currently active, andapplication internal state 192 is used by event sorter 170 to determineapplication views 191 to which to deliver event information.

In some embodiments, application internal state 192 includes additionalinformation, such as one or more of: resume information to be used whenapplication 136-1 resumes execution, user interface state informationthat indicates information being displayed or that is ready for displayby application 136-1, a state queue for enabling the user to go back toa prior state or view of application 136-1, and a redo/undo queue ofprevious actions taken by the user.

Event monitor 171 receives event information from peripherals interface118. Event information includes information about a sub-event (e.g., auser touch on touch-sensitive display 112, as part of a multi-touchgesture). Peripherals interface 118 transmits information it receivesfrom I/O subsystem 106 or a sensor, such as proximity sensor 166,accelerometer(s) 168, and/or microphone 113 (through audio circuitry110). Information that peripherals interface 118 receives from I/Osubsystem 106 includes information from touch-sensitive display 112 or atouch-sensitive surface.

In some embodiments, event monitor 171 sends requests to the peripheralsinterface 118 at predetermined intervals. In response, peripheralsinterface 118 transmits event information. In other embodiments,peripherals interface 118 transmits event information only when there isa significant event (e.g., receiving an input above a predeterminednoise threshold and/or for more than a predetermined duration).

In some embodiments, event sorter 170 also includes a hit viewdetermination module 172 and/or an active event recognizer determinationmodule 173.

Hit view determination module 172 provides software procedures fordetermining where a sub-event has taken place within one or more viewswhen touch-sensitive display 112 displays more than one view. Views aremade up of controls and other elements that a user can see on thedisplay.

Another aspect of the user interface associated with an application is aset of views, sometimes herein called application views or userinterface windows, in which information is displayed and touch-basedgestures occur. The application views (of a respective application) inwhich a touch is detected optionally correspond to programmatic levelswithin a programmatic or view hierarchy of the application. For example,the lowest level view in which a touch is detected is, optionally,called the hit view, and the set of events that are recognized as properinputs are, optionally, determined based, at least in part, on the hitview of the initial touch that begins a touch-based gesture.

Hit view determination module 172 receives information related tosub-events of a touch-based gesture. When an application has multipleviews organized in a hierarchy, hit view determination module 172identifies a hit view as the lowest view in the hierarchy which shouldhandle the sub-event. In most circumstances, the hit view is the lowestlevel view in which an initiating sub-event occurs (e.g., the firstsub-event in the sequence of sub-events that form an event or potentialevent). Once the hit view is identified by the hit view determinationmodule 172, the hit view typically receives all sub-events related tothe same touch or input source for which it was identified as the hitview.

Active event recognizer determination module 173 determines which viewor views within a view hierarchy should receive a particular sequence ofsub-events. In some embodiments, active event recognizer determinationmodule 173 determines that only the hit view should receive a particularsequence of sub-events. In other embodiments, active event recognizerdetermination module 173 determines that all views that include thephysical location of a sub-event are actively involved views, andtherefore determines that all actively involved views should receive aparticular sequence of sub-events. In other embodiments, even if touchsub-events were entirely confined to the area associated with oneparticular view, views higher in the hierarchy would still remain asactively involved views.

Event dispatcher module 174 dispatches the event information to an eventrecognizer (e.g., event recognizer 180). In embodiments including activeevent recognizer determination module 173, event dispatcher module 174delivers the event information to an event recognizer determined byactive event recognizer determination module 173. In some embodiments,event dispatcher module 174 stores in an event queue the eventinformation, which is retrieved by a respective event receiver 182.

In some embodiments, operating system 126 includes event sorter 170.Alternatively, application 136-1 includes event sorter 170. In yet otherembodiments, event sorter 170 is a stand-alone module, or a part ofanother module stored in memory 102, such as contact/motion module 130.

In some embodiments, application 136-1 includes a plurality of eventhandlers 190 and one or more application views 191, each of whichincludes instructions for handling touch events that occur within arespective view of the application's user interface. Each applicationview 191 of the application 136-1 includes one or more event recognizers180. Typically, a respective application view 191 includes a pluralityof event recognizers 180. In other embodiments, one or more of eventrecognizers 180 are part of a separate module, such as a user interfacekit or a higher level object from which application 136-1 inheritsmethods and other properties. In some embodiments, a respective eventhandler 190 includes one or more of: data updater 176, object updater177, GUI updater 178, and/or event data 179 received from event sorter170. Event handler 190 optionally utilizes or calls data updater 176,object updater 177, or GUI updater 178 to update the applicationinternal state 192. Alternatively, one or more of the application views191 include one or more respective event handlers 190. Also, in someembodiments, one or more of data updater 176, object updater 177, andGUI updater 178 are included in a respective application view 191.

A respective event recognizer 180 receives event information (e.g.,event data 179) from event sorter 170 and identifies an event from theevent information. Event recognizer 180 includes event receiver 182 andevent comparator 184. In some embodiments, event recognizer 180 alsoincludes at least a subset of: metadata 183, and event deliveryinstructions 188 (which optionally include sub-event deliveryinstructions).

Event receiver 182 receives event information from event sorter 170. Theevent information includes information about a sub-event, for example, atouch or a touch movement. Depending on the sub-event, the eventinformation also includes additional information, such as location ofthe sub-event. When the sub-event concerns motion of a touch, the eventinformation optionally also includes speed and direction of thesub-event. In some embodiments, events include rotation of the devicefrom one orientation to another (e.g., from a portrait orientation to alandscape orientation, or vice versa), and the event informationincludes corresponding information about the current orientation (alsocalled device attitude) of the device.

Event comparator 184 compares the event information to predefined eventor sub-event definitions and, based on the comparison, determines anevent or sub-event, or determines or updates the state of an event orsub-event. In some embodiments, event comparator 184 includes eventdefinitions 186. Event definitions 186 contain definitions of events(e.g., predefined sequences of sub-events), for example, event 1(187-1), event 2 (187-2), and others. In some embodiments, sub-events inan event (187) include, for example, touch begin, touch end, touchmovement, touch cancellation, and multiple touching. In one example, thedefinition for event 1 (187-1) is a double tap on a displayed object.The double tap, for example, comprises a first touch (touch begin) onthe displayed object for a predetermined phase, a first liftoff (touchend) for a predetermined phase, a second touch (touch begin) on thedisplayed object for a predetermined phase, and a second liftoff (touchend) for a predetermined phase. In another example, the definition forevent 2 (187-2) is a dragging on a displayed object. The dragging, forexample, comprises a touch (or contact) on the displayed object for apredetermined phase, a movement of the touch across touch-sensitivedisplay 112, and liftoff of the touch (touch end). In some embodiments,the event also includes information for one or more associated eventhandlers 190.

In some embodiments, event definition 187 includes a definition of anevent for a respective user-interface object. In some embodiments, eventcomparator 184 performs a hit test to determine which user-interfaceobject is associated with a sub-event. For example, in an applicationview in which three user-interface objects are displayed ontouch-sensitive display 112, when a touch is detected on touch-sensitivedisplay 112, event comparator 184 performs a hit test to determine whichof the three user-interface objects is associated with the touch(sub-event). If each displayed object is associated with a respectiveevent handler 190, the event comparator uses the result of the hit testto determine which event handler 190 should be activated. For example,event comparator 184 selects an event handler associated with thesub-event and the object triggering the hit test.

In some embodiments, the definition for a respective event (187) alsoincludes delayed actions that delay delivery of the event informationuntil after it has been determined whether the sequence of sub-eventsdoes or does not correspond to the event recognizer's event type.

When a respective event recognizer 180 determines that the series ofsub-events do not match any of the events in event definitions 186, therespective event recognizer 180 enters an event impossible, eventfailed, or event ended state, after which it disregards subsequentsub-events of the touch-based gesture. In this situation, other eventrecognizers, if any, that remain active for the hit view continue totrack and process sub-events of an ongoing touch-based gesture.

In some embodiments, a respective event recognizer 180 includes metadata183 with configurable properties, flags, and/or lists that indicate howthe event delivery system should perform sub-event delivery to activelyinvolved event recognizers. In some embodiments, metadata 183 includesconfigurable properties, flags, and/or lists that indicate how eventrecognizers interact, or are enabled to interact, with one another. Insome embodiments, metadata 183 includes configurable properties, flags,and/or lists that indicate whether sub-events are delivered to varyinglevels in the view or programmatic hierarchy.

In some embodiments, a respective event recognizer 180 activates eventhandler 190 associated with an event when one or more particularsub-events of an event are recognized. In some embodiments, a respectiveevent recognizer 180 delivers event information associated with theevent to event handler 190. Activating an event handler 190 is distinctfrom sending (and deferred sending) sub-events to a respective hit view.In some embodiments, event recognizer 180 throws a flag associated withthe recognized event, and event handler 190 associated with the flagcatches the flag and performs a predefined process.

In some embodiments, event delivery instructions 188 include sub-eventdelivery instructions that deliver event information about a sub-eventwithout activating an event handler. Instead, the sub-event deliveryinstructions deliver event information to event handlers associated withthe series of sub-events or to actively involved views. Event handlersassociated with the series of sub-events or with actively involved viewsreceive the event information and perform a predetermined process.

In some embodiments, data updater 176 creates and updates data used inapplication 136-1. For example, data updater 176 updates the telephonenumber used in contacts module 137, or stores a video file used in videoplayer module. In some embodiments, object updater 177 creates andupdates objects used in application 136-1. For example, object updater177 creates a new user-interface object or updates the position of auser-interface object. GUI updater 178 updates the GUI. For example, GUIupdater 178 prepares display information and sends it to graphics module132 for display on a touch-sensitive display.

In some embodiments, event handler(s) 190 includes or has access to dataupdater 176, object updater 177, and GUI updater 178. In someembodiments, data updater 176, object updater 177, and GUI updater 178are included in a single module of a respective application 136-1 orapplication view 191. In other embodiments, they are included in two ormore software modules.

It shall be understood that the foregoing discussion regarding eventhandling of user touches on touch-sensitive displays also applies toother forms of user inputs to operate multifunction devices 100 withinput devices, not all of which are initiated on touch screens. Forexample, mouse movement and mouse button presses, optionally coordinatedwith single or multiple keyboard presses or holds; contact movementssuch as taps, drags, scrolls, etc. on touchpads; pen stylus inputs;movement of the device; oral instructions; detected eye movements;biometric inputs; and/or any combination thereof are optionally utilizedas inputs corresponding to sub-events which define an event to berecognized.

FIG. 2 illustrates a portable multifunction device 100 having a touchscreen 112 in accordance with some embodiments. The touch screenoptionally displays one or more graphics within user interface (UI) 200.In this embodiment, as well as others described below, a user is enabledto select one or more of the graphics by making a gesture on thegraphics, for example, with one or more fingers 202 (not drawn to scalein the figure) or one or more styluses 203 (not drawn to scale in thefigure). In some embodiments, selection of one or more graphics occurswhen the user breaks contact with the one or more graphics. In someembodiments, the gesture optionally includes one or more taps, one ormore swipes (from left to right, right to left, upward and/or downward),and/or a rolling of a finger (from right to left, left to right, upwardand/or downward) that has made contact with device 100. In someimplementations or circumstances, inadvertent contact with a graphicdoes not select the graphic. For example, a swipe gesture that sweepsover an application icon optionally does not select the correspondingapplication when the gesture corresponding to selection is a tap.

Device 100 optionally also include one or more physical buttons, such as“home” or menu button 204. As described previously, menu button 204 is,optionally, used to navigate to any application 136 in a set ofapplications that are, optionally, executed on device 100.Alternatively, in some embodiments, the menu button is implemented as asoft key in a GUI displayed on touch screen 112.

In some embodiments, device 100 includes touch screen 112, menu button204, push button 206 for powering the device on/off and locking thedevice, volume adjustment button(s) 208, subscriber identity module(SIM) card slot 210, headset jack 212, and docking/charging externalport 124. Push button 206 is, optionally, used to turn the power on/offon the device by depressing the button and holding the button in thedepressed state for a predefined time interval; to lock the device bydepressing the button and releasing the button before the predefinedtime interval has elapsed; and/or to unlock the device or initiate anunlock process. In an alternative embodiment, device 100 also acceptsverbal input for activation or deactivation of some functions throughmicrophone 113. Device 100 also, optionally, includes one or morecontact intensity sensors 165 for detecting intensity of contacts ontouch screen 112 and/or one or more tactile output generators 167 forgenerating tactile outputs for a user of device 100.

FIG. 3 is a block diagram of an exemplary multifunction device with adisplay and a touch-sensitive surface in accordance with someembodiments. Device 300 need not be portable. In some embodiments,device 300 is a laptop computer, a desktop computer, a tablet computer,a multimedia player device, a navigation device, an educational device(such as a child's learning toy), a gaming system, or a control device(e.g., a home or industrial controller). Device 300 typically includesone or more processing units (CPUs) 310, one or more network or othercommunications interfaces 360, memory 370, and one or more communicationbuses 320 for interconnecting these components. Communication buses 320optionally include circuitry (sometimes called a chipset) thatinterconnects and controls communications between system components.Device 300 includes input/output (I/O) interface 330 comprising display340, which is typically a touch screen display. I/O interface 330 alsooptionally includes a keyboard and/or mouse (or other pointing device)350 and touchpad 355, tactile output generator 357 for generatingtactile outputs on device 300 (e.g., similar to tactile outputgenerator(s) 167 described above with reference to FIG. 1A), sensors 359(e.g., optical, acceleration, proximity, touch-sensitive, and/or contactintensity sensors similar to contact intensity sensor(s) 165 describedabove with reference to FIG. 1A). Memory 370 includes high-speed randomaccess memory, such as DRAM, SRAM, DDR RAM, or other random access solidstate memory devices; and optionally includes non-volatile memory, suchas one or more magnetic disk storage devices, optical disk storagedevices, flash memory devices, or other non-volatile solid state storagedevices. Memory 370 optionally includes one or more storage devicesremotely located from CPU(s) 310. In some embodiments, memory 370 storesprograms, modules, and data structures analogous to the programs,modules, and data structures stored in memory 102 of portablemultifunction device 100 (FIG. 1A), or a subset thereof. Furthermore,memory 370 optionally stores additional programs, modules, and datastructures not present in memory 102 of portable multifunction device100. For example, memory 370 of device 300 optionally stores drawingmodule 380, presentation module 382, word processing module 384, websitecreation module 386, disk authoring module 388, and/or spreadsheetmodule 390, while memory 102 of portable multifunction device 100 (FIG.1A) optionally does not store these modules.

Each of the above-identified elements in FIG. 3 is, optionally, storedin one or more of the previously mentioned memory devices. Each of theabove-identified modules corresponds to a set of instructions forperforming a function described above. The above-identified modules orprograms (e.g., sets of instructions) need not be implemented asseparate software programs, procedures, or modules, and thus varioussubsets of these modules are, optionally, combined or otherwiserearranged in various embodiments. In some embodiments, memory 370optionally stores a subset of the modules and data structures identifiedabove. Furthermore, memory 370 optionally stores additional modules anddata structures not described above.

Attention is now directed towards embodiments of user interfaces thatare, optionally, implemented on, for example, portable multifunctiondevice 100.

FIG. 4A illustrates an exemplary user interface for a menu ofapplications on portable multifunction device 100 in accordance withsome embodiments. Similar user interfaces are, optionally, implementedon device 300. In some embodiments, user interface 400 includes thefollowing elements, or a subset or superset thereof:

-   -   Signal strength indicator(s) 402 for wireless communication(s),        such as cellular and Wi-Fi signals;    -   Time 404;    -   Bluetooth indicator 405;    -   Battery status indicator 406;    -   Tray 408 with icons for frequently used applications, such as:        -   Icon 416 for telephone module 138, labeled “Phone,” which            optionally includes an indicator 414 of the number of missed            calls or voicemail messages;        -   Icon 418 for e-mail client module 140, labeled “Mail,” which            optionally includes an indicator 410 of the number of unread            e-mails;        -   Icon 420 for browser module 147, labeled “Browser;” and        -   Icon 422 for video and music player module 152, also            referred to as iPod (trademark of Apple Inc.) module 152,            labeled “iPod;” and    -   Icons for other applications, such as:        -   Icon 424 for IM module 141, labeled “Messages;”        -   Icon 426 for calendar module 148, labeled “Calendar;”        -   Icon 428 for image management module 144, labeled “Photos;”        -   Icon 430 for camera module 143, labeled “Camera;”        -   Icon 432 for online video module 155, labeled “Online            Video;”        -   Icon 434 for stocks widget 149-2, labeled “Stocks;”        -   Icon 436 for map module 154, labeled “Maps;”        -   Icon 438 for weather widget 149-1, labeled “Weather;”        -   Icon 440 for alarm clock widget 149-4, labeled “Clock;”        -   Icon 442 for workout support module 142, labeled “Workout            Support;”        -   Icon 444 for notes module 153, labeled “Notes;” and        -   Icon 446 for a settings application or module, labeled            “Settings,” which provides access to settings for device 100            and its various applications 136.

It should be noted that the icon labels illustrated in FIG. 4A aremerely exemplary. For example, icon 422 for video and music playermodule 152 is labeled “Music” or “Music Player.” Other labels are,optionally, used for various application icons. In some embodiments, alabel for a respective application icon includes a name of anapplication corresponding to the respective application icon. In someembodiments, a label for a particular application icon is distinct froma name of an application corresponding to the particular applicationicon.

FIG. 4B illustrates an exemplary user interface on a device (e.g.,device 300, FIG. 3) with a touch-sensitive surface 451 (e.g., a tabletor touchpad 355, FIG. 3) that is separate from the display 450 (e.g.,touch screen display 112). Device 300 also, optionally, includes one ormore contact intensity sensors (e.g., one or more of sensors 359) fordetecting intensity of contacts on touch-sensitive surface 451 and/orone or more tactile output generators 357 for generating tactile outputsfor a user of device 300.

Although some of the examples that follow will be given with referenceto inputs on touch screen display 112 (where the touch-sensitive surfaceand the display are combined), in some embodiments, the device detectsinputs on a touch-sensitive surface that is separate from the display,as shown in FIG. 4B. In some embodiments, the touch-sensitive surface(e.g., 451 in FIG. 4B) has a primary axis (e.g., 452 in FIG. 4B) thatcorresponds to a primary axis (e.g., 453 in FIG. 4B) on the display(e.g., 450). In accordance with these embodiments, the device detectscontacts (e.g., 460 and 462 in FIG. 4B) with the touch-sensitive surface451 at locations that correspond to respective locations on the display(e.g., in FIG. 4B, 460 corresponds to 468 and 462 corresponds to 470).In this way, user inputs (e.g., contacts 460 and 462, and movementsthereof) detected by the device on the touch-sensitive surface (e.g.,451 in FIG. 4B) are used by the device to manipulate the user interfaceon the display (e.g., 450 in FIG. 4B) of the multifunction device whenthe touch-sensitive surface is separate from the display. It should beunderstood that similar methods are, optionally, used for other userinterfaces described herein.

Additionally, while the following examples are given primarily withreference to finger inputs (e.g., finger contacts, finger tap gestures,finger swipe gestures), it should be understood that, in someembodiments, one or more of the finger inputs are replaced with inputfrom another input device (e.g., a mouse-based input or stylus input).For example, a swipe gesture is, optionally, replaced with a mouse click(e.g., instead of a contact) followed by movement of the cursor alongthe path of the swipe (e.g., instead of movement of the contact). Asanother example, a tap gesture is, optionally, replaced with a mouseclick while the cursor is located over the location of the tap gesture(e.g., instead of detection of the contact followed by ceasing to detectthe contact). Similarly, when multiple user inputs are simultaneouslydetected, it should be understood that multiple computer mice are,optionally, used simultaneously, or a mouse and finger contacts are,optionally, used simultaneously.

FIG. 5A illustrates exemplary personal electronic device 500. Device 500includes body 502. In some embodiments, device 500 can include some orall of the features described with respect to devices 100 and 300 (e.g.,FIGS. 1A-4B). In some embodiments, device 500 has touch-sensitivedisplay screen 504, hereafter touch screen 504. Alternatively, or inaddition to touch screen 504, device 500 has a display and atouch-sensitive surface. As with devices 100 and 300, in someembodiments, touch screen 504 (or the touch-sensitive surface)optionally includes one or more intensity sensors for detectingintensity of contacts (e.g., touches) being applied. The one or moreintensity sensors of touch screen 504 (or the touch-sensitive surface)can provide output data that represents the intensity of touches. Theuser interface of device 500 can respond to touches based on theirintensity, meaning that touches of different intensities can invokedifferent user interface operations on device 500.

Exemplary techniques for detecting and processing touch intensity arefound, for example, in related applications: International PatentApplication Serial No. PCT/US2013/040061, titled “Device, Method, andGraphical User Interface for Displaying User Interface ObjectsCorresponding to an Application,” filed May 8, 2013, published as WIPOPublication No. WO/2013/169849, and International Patent ApplicationSerial No. PCT/US2013/069483, titled “Device, Method, and Graphical UserInterface for Transitioning Between Touch Input to Display OutputRelationships,” filed Nov. 11, 2013, published as WIPO Publication No.WO/2014/105276, each of which is hereby incorporated by reference intheir entirety.

In some embodiments, device 500 has one or more input mechanisms 506 and508. Input mechanisms 506 and 508, if included, can be physical.Examples of physical input mechanisms include push buttons and rotatablemechanisms. In some embodiments, device 500 has one or more attachmentmechanisms. Such attachment mechanisms, if included, can permitattachment of device 500 with, for example, hats, eyewear, earrings,necklaces, shirts, jackets, bracelets, watch straps, chains, trousers,belts, shoes, purses, backpacks, and so forth. These attachmentmechanisms permit device 500 to be worn by a user.

FIG. 5B depicts exemplary personal electronic device 500. In someembodiments, device 500 can include some or all of the componentsdescribed with respect to FIGS. 1A, 1B, and 3. Device 500 has bus 512that operatively couples I/O section 514 with one or more computerprocessors 516 and memory 518. I/O section 514 can be connected todisplay 504, which can have touch-sensitive component 522 and,optionally, intensity sensor 524 (e.g., contact intensity sensor). Inaddition, I/O section 514 can be connected with communication unit 530for receiving application and operating system data, using Wi-Fi,Bluetooth, near field communication (NFC), cellular, and/or otherwireless communication techniques. Device 500 can include inputmechanisms 506 and/or 508. Input mechanism 506 is, optionally, arotatable input device or a depressible and rotatable input device, forexample. Input mechanism 508 is, optionally, a button, in some examples.

Input mechanism 508 is, optionally, a microphone, in some examples.Personal electronic device 500 optionally includes various sensors, suchas GPS sensor 532, accelerometer 534, directional sensor 540 (e.g.,compass), gyroscope 536, motion sensor 538, and/or a combinationthereof, all of which can be operatively connected to I/O section 514.

Memory 518 of personal electronic device 500 can include one or morenon-transitory computer-readable storage mediums, for storingcomputer-executable instructions, which, when executed by one or morecomputer processors 516, for example, can cause the computer processorsto perform the techniques described below, including processes 700, 900,1100, 1300, and 1500 (FIGS. 7A-7B, 9A-9B, 11A-11B, 13A-13B, and15A-15B). A computer-readable storage medium can be any medium that cantangibly contain or store computer-executable instructions for use by orin connection with the instruction execution system, apparatus, ordevice. In some examples, the storage medium is a transitorycomputer-readable storage medium. In some examples, the storage mediumis a non-transitory computer-readable storage medium. The non-transitorycomputer-readable storage medium can include, but is not limited to,magnetic, optical, and/or semiconductor storages. Examples of suchstorage include magnetic disks, optical discs based on CD, DVD, orBlu-ray technologies, as well as persistent solid-state memory such asflash, solid-state drives, and the like. Personal electronic device 500is not limited to the components and configuration of FIG. 5B, but caninclude other or additional components in multiple configurations.

As used here, the term “affordance” refers to a user-interactivegraphical user interface object that is, optionally, displayed on thedisplay screen of devices 100, 300, and/or 500 (FIGS. 1A, 3, and 5A-5B).For example, an image (e.g., icon), a button, and text (e.g., hyperlink)each optionally constitute an affordance.

As used herein, the term “focus selector” refers to an input elementthat indicates a current part of a user interface with which a user isinteracting. In some implementations that include a cursor or otherlocation marker, the cursor acts as a “focus selector” so that when aninput (e.g., a press input) is detected on a touch-sensitive surface(e.g., touchpad 355 in FIG. 3 or touch-sensitive surface 451 in FIG. 4B)while the cursor is over a particular user interface element (e.g., abutton, window, slider, or other user interface element), the particularuser interface element is adjusted in accordance with the detectedinput. In some implementations that include a touch screen display(e.g., touch-sensitive display system 112 in FIG. 1A or touch screen 112in FIG. 4A) that enables direct interaction with user interface elementson the touch screen display, a detected contact on the touch screen actsas a “focus selector” so that when an input (e.g., a press input by thecontact) is detected on the touch screen display at a location of aparticular user interface element (e.g., a button, window, slider, orother user interface element), the particular user interface element isadjusted in accordance with the detected input. In some implementations,focus is moved from one region of a user interface to another region ofthe user interface without corresponding movement of a cursor ormovement of a contact on a touch screen display (e.g., by using a tabkey or arrow keys to move focus from one button to another button); inthese implementations, the focus selector moves in accordance withmovement of focus between different regions of the user interface.Without regard to the specific form taken by the focus selector, thefocus selector is generally the user interface element (or contact on atouch screen display) that is controlled by the user so as tocommunicate the user's intended interaction with the user interface(e.g., by indicating, to the device, the element of the user interfacewith which the user is intending to interact). For example, the locationof a focus selector (e.g., a cursor, a contact, or a selection box) overa respective button while a press input is detected on thetouch-sensitive surface (e.g., a touchpad or touch screen) will indicatethat the user is intending to activate the respective button (as opposedto other user interface elements shown on a display of the device).

As used in the specification and claims, the term “characteristicintensity” of a contact refers to a characteristic of the contact basedon one or more intensities of the contact. In some embodiments, thecharacteristic intensity is based on multiple intensity samples. Thecharacteristic intensity is, optionally, based on a predefined number ofintensity samples, or a set of intensity samples collected during apredetermined time period (e.g., 0.05, 0.1, 0.2, 0.5, 1, 2, 5, 10seconds) relative to a predefined event (e.g., after detecting thecontact, prior to detecting liftoff of the contact, before or afterdetecting a start of movement of the contact, prior to detecting an endof the contact, before or after detecting an increase in intensity ofthe contact, and/or before or after detecting a decrease in intensity ofthe contact). A characteristic intensity of a contact is, optionally,based on one or more of: a maximum value of the intensities of thecontact, a mean value of the intensities of the contact, an averagevalue of the intensities of the contact, a top 10 percentile value ofthe intensities of the contact, a value at the half maximum of theintensities of the contact, a value at the 90 percent maximum of theintensities of the contact, or the like. In some embodiments, theduration of the contact is used in determining the characteristicintensity (e.g., when the characteristic intensity is an average of theintensity of the contact over time). In some embodiments, thecharacteristic intensity is compared to a set of one or more intensitythresholds to determine whether an operation has been performed by auser. For example, the set of one or more intensity thresholdsoptionally includes a first intensity threshold and a second intensitythreshold. In this example, a contact with a characteristic intensitythat does not exceed the first threshold results in a first operation, acontact with a characteristic intensity that exceeds the first intensitythreshold and does not exceed the second intensity threshold results ina second operation, and a contact with a characteristic intensity thatexceeds the second threshold results in a third operation. In someembodiments, a comparison between the characteristic intensity and oneor more thresholds is used to determine whether or not to perform one ormore operations (e.g., whether to perform a respective operation orforgo performing the respective operation), rather than being used todetermine whether to perform a first operation or a second operation.

In some embodiments, a portion of a gesture is identified for purposesof determining a characteristic intensity. For example, atouch-sensitive surface optionally receives a continuous swipe contacttransitioning from a start location and reaching an end location, atwhich point the intensity of the contact increases. In this example, thecharacteristic intensity of the contact at the end location is,optionally, based on only a portion of the continuous swipe contact, andnot the entire swipe contact (e.g., only the portion of the swipecontact at the end location). In some embodiments, a smoothing algorithmis, optionally, applied to the intensities of the swipe contact prior todetermining the characteristic intensity of the contact. For example,the smoothing algorithm optionally includes one or more of: anunweighted sliding-average smoothing algorithm, a triangular smoothingalgorithm, a median filter smoothing algorithm, and/or an exponentialsmoothing algorithm. In some circumstances, these smoothing algorithmseliminate narrow spikes or dips in the intensities of the swipe contactfor purposes of determining a characteristic intensity.

The intensity of a contact on the touch-sensitive surface is,optionally, characterized relative to one or more intensity thresholds,such as a contact-detection intensity threshold, a light press intensitythreshold, a deep press intensity threshold, and/or one or more otherintensity thresholds. In some embodiments, the light press intensitythreshold corresponds to an intensity at which the device will performoperations typically associated with clicking a button of a physicalmouse or a trackpad. In some embodiments, the deep press intensitythreshold corresponds to an intensity at which the device will performoperations that are different from operations typically associated withclicking a button of a physical mouse or a trackpad. In someembodiments, when a contact is detected with a characteristic intensitybelow the light press intensity threshold (e.g., and above a nominalcontact-detection intensity threshold below which the contact is nolonger detected), the device will move a focus selector in accordancewith movement of the contact on the touch-sensitive surface withoutperforming an operation associated with the light press intensitythreshold or the deep press intensity threshold. Generally, unlessotherwise stated, these intensity thresholds are consistent betweendifferent sets of user interface figures.

An increase of characteristic intensity of the contact from an intensitybelow the light press intensity threshold to an intensity between thelight press intensity threshold and the deep press intensity thresholdis sometimes referred to as a “light press” input. An increase ofcharacteristic intensity of the contact from an intensity below the deeppress intensity threshold to an intensity above the deep press intensitythreshold is sometimes referred to as a “deep press” input. An increaseof characteristic intensity of the contact from an intensity below thecontact-detection intensity threshold to an intensity between thecontact-detection intensity threshold and the light press intensitythreshold is sometimes referred to as detecting the contact on thetouch-surface. A decrease of characteristic intensity of the contactfrom an intensity above the contact-detection intensity threshold to anintensity below the contact-detection intensity threshold is sometimesreferred to as detecting liftoff of the contact from the touch-surface.In some embodiments, the contact-detection intensity threshold is zero.In some embodiments, the contact-detection intensity threshold isgreater than zero.

In some embodiments described herein, one or more operations areperformed in response to detecting a gesture that includes a respectivepress input or in response to detecting the respective press inputperformed with a respective contact (or a plurality of contacts), wherethe respective press input is detected based at least in part ondetecting an increase in intensity of the contact (or plurality ofcontacts) above a press-input intensity threshold. In some embodiments,the respective operation is performed in response to detecting theincrease in intensity of the respective contact above the press-inputintensity threshold (e.g., a “down stroke” of the respective pressinput). In some embodiments, the press input includes an increase inintensity of the respective contact above the press-input intensitythreshold and a subsequent decrease in intensity of the contact belowthe press-input intensity threshold, and the respective operation isperformed in response to detecting the subsequent decrease in intensityof the respective contact below the press-input threshold (e.g., an “upstroke” of the respective press input).

In some embodiments, the device employs intensity hysteresis to avoidaccidental inputs sometimes termed “jitter,” where the device defines orselects a hysteresis intensity threshold with a predefined relationshipto the press-input intensity threshold (e.g., the hysteresis intensitythreshold is X intensity units lower than the press-input intensitythreshold or the hysteresis intensity threshold is 75%, 90%, or somereasonable proportion of the press-input intensity threshold). Thus, insome embodiments, the press input includes an increase in intensity ofthe respective contact above the press-input intensity threshold and asubsequent decrease in intensity of the contact below the hysteresisintensity threshold that corresponds to the press-input intensitythreshold, and the respective operation is performed in response todetecting the subsequent decrease in intensity of the respective contactbelow the hysteresis intensity threshold (e.g., an “up stroke” of therespective press input). Similarly, in some embodiments, the press inputis detected only when the device detects an increase in intensity of thecontact from an intensity at or below the hysteresis intensity thresholdto an intensity at or above the press-input intensity threshold and,optionally, a subsequent decrease in intensity of the contact to anintensity at or below the hysteresis intensity, and the respectiveoperation is performed in response to detecting the press input (e.g.,the increase in intensity of the contact or the decrease in intensity ofthe contact, depending on the circumstances).

For ease of explanation, the descriptions of operations performed inresponse to a press input associated with a press-input intensitythreshold or in response to a gesture including the press input are,optionally, triggered in response to detecting either: an increase inintensity of a contact above the press-input intensity threshold, anincrease in intensity of a contact from an intensity below thehysteresis intensity threshold to an intensity above the press-inputintensity threshold, a decrease in intensity of the contact below thepress-input intensity threshold, and/or a decrease in intensity of thecontact below the hysteresis intensity threshold corresponding to thepress-input intensity threshold. Additionally, in examples where anoperation is described as being performed in response to detecting adecrease in intensity of a contact below the press-input intensitythreshold, the operation is, optionally, performed in response todetecting a decrease in intensity of the contact below a hysteresisintensity threshold corresponding to, and lower than, the press-inputintensity threshold.

Attention is now directed towards embodiments of user interfaces (“UI”)and associated processes that are implemented on an electronic device,such as portable multifunction device 100, device 300, or device 500.

FIGS. 6A-6BQ illustrate exemplary user interfaces for displaying visualeffects in a messaging application, in accordance with some embodiments.The user interfaces in these figures are used to illustrate theprocesses described below, including the processes in FIGS. 7A-7B.

FIG. 6A depicts device 600 having display 601, which in some cases is atouch-sensitive display. In some embodiments, device 600 also includescamera 602, which, at a minimum, includes an image sensor that iscapable of capturing data representing a portion of the light spectrum(e.g., visible light, infrared light, or ultraviolet light). In someembodiments, camera 602 includes multiple image sensors and/or othertypes of sensors. In addition to capturing data representing sensedlight, in some embodiments, camera 602 is capable of capturing othertypes of data, such as depth data. For example, in some embodiments,camera 602 also captures depth data using techniques based on speckle,time-of-flight, parallax, or focus. Image data that device 600 capturesusing camera 602 includes data corresponding to a portion of the lightspectrum for a scene within the field of view of the camera.Additionally, in some embodiments, the captured image data also includesdepth data for the light data. In some other embodiments, the capturedimage data contains data sufficient to determine or generate depth datafor the data for the portion of the light spectrum. In some embodiments,device 600 includes one or more features of devices 100, 300, or 500.

In some examples, electronic device 600 includes a depth camera, such asan infrared camera, a thermographic camera, or a combination thereof. Insome examples, the device further includes a light-emitting device(e.g., light projector), such an IR flood light, a structured lightprojector, or a combination thereof. The light-emitting device is,optionally, used to illuminate the subject during capture of the imageby a visible light camera and a depth camera (e.g., an IR camera) andthe information from the depth camera and the visible light camera areused to determine a depth map of different portions of subject capturedby the visible light camera. In some embodiments, a depth map (e.g.,depth map image) contains information (e.g., values) that relates to thedistance of objects in a scene from a viewpoint (e.g., a camera). In oneembodiment of a depth map, each depth pixel defines the position in theviewpoint's Z-axis where its corresponding two-dimensional pixel islocated. In some examples, a depth map is composed of pixels whereineach pixel is defined by a value (e.g., 0-255). For example, the “0”value represents pixels that are located at the most distant place in a“three dimensional” scene and the “255” value represents pixels that arelocated closest to a viewpoint (e.g., camera) in the “three dimensional”scene. In other examples, a depth map represents the distance between anobject in a scene and the plane of the viewpoint.) In some embodiments,the depth map includes information about the relative depth of variousfeatures of an object of interest in view of the depth camera (e.g., therelative depth of eyes, nose, mouth, ears of a user's face). In someembodiments, the depth map includes information that enables the deviceto determine contours of the object of interest in a z direction. Insome embodiments, the lighting effects described herein are displayedusing disparity information from two cameras (e.g., two visual lightcameras) for rear facing images and using depth information from a depthcamera combined with image data from a visual light camera for frontfacing images (e.g., selfie images). In some embodiments, the same userinterface is used when the two visual light cameras are used todetermine the depth information and when the depth camera is used todetermine the depth information, providing the user with a consistentexperience, even when using dramatically different technologies todetermine the information that is used when generating the lightingeffects. In some embodiments, while displaying the camera user interfacewith one of the lighting effects applied, the device detects selectionof a camera switching affordance and switches from the front facingcameras (e.g., a depth camera and a visible light camera) to therear-facing cameras (e.g., two visible light cameras that are spacedapart from each other) (or vice versa) while maintaining display of theuser interface controls for applying the lighting effect and replacingdisplay of the field of view of the front facing cameras to the field ofview of the rear facing cameras (or vice versa).

In FIG. 6A, device 600 is displaying messaging user interface 603 of amessaging application. Messaging user interface 603 includes messagedisplay region 604 including messages 605 transmitted to a participant(represented by recipient identifier 606) in a message conversation.Messaging user interface 603 also includes a message-compose field 608for displaying input (e.g., text input, multimedia input, etc.) forsending to the participant in the message conversation. Messaging userinterface 603 also includes camera application affordance 609,application dock affordance 610, keyboard display region 612, andtext-suggestion region 614.

In FIG. 6B, device 600 detects input 616 (e.g., a touch input on display601) at a location corresponding to camera application affordance 609.

In FIG. 6C, in response to detecting input 616, device 600 launches acamera application associated with camera application affordance 609 anddisplays camera application user interface 615. Camera application userinterface 615 includes image display region 620 which displays arepresentation of image data such as, for example, streamed image data(e.g., a live camera preview, live camera recording, or live videocommunications session) representing objects positioned within afield-of-view of a camera (e.g., a rear-facing camera or camera 602), ora media item such as, for example, a photograph or a video recording. Inthe embodiment illustrated in FIG. 6C, image display region 620 showslive camera preview 620-1′ from a rear-facing camera of device 600.

Camera application user interface 615 also includes a region above imagedisplay region 620 that includes camera-specific affordances 617 anddone affordance 618 for exiting camera application user interface 615.Camera-specific affordances include affordance 617-1 associated with acamera flash function, affordance 617-2 associated with a camera modefunction, affordance 617-3 associated with a timer function, andaffordance 617-4 associated with a filter function.

Camera application user interface 615 also includes camera optionsregion 625 positioned below image display region 620. Camera optionsregion 625 includes camera selector affordance 627 for switching betweencameras (e.g., a rear-facing camera and camera 602), and camera optionaffordances 619 associated with different capture modes in which acamera can record image data. For example, video affordance 619-1 isassociated with a function for activating a video recording capture modeof the camera, and photo affordance 619-2 is associated with a functionfor activating a still image capture mode of the camera. In theembodiments discussed below with respect to FIGS. 6C-6AX, device 600 isin the still image capture mode of operation associated with photoaffordance 619-2. However, unless specified otherwise, these embodimentsalso apply to the video recording mode associated with video affordance619-1.

Camera options region 625 further includes effects affordance 622 forenabling and disabling a mode of device 600 in which device 600 isenabled or disabled for displaying visual effects in image displayregion 620. This mode of device 600 is often referred to herein as aneffects mode.

Camera options region 625 also includes capture affordance 621, whichcan be selected to capture image data represented in image displayregion 620. In some embodiments, device 600 captures the image data in amanner based on the currently enabled capture mode (e.g., videorecording capture mode or image capture mode). In some embodiments,device 600 captures the image data depending on the type of gesturedetected on capture affordance 621. For example, if device 600 detects atap gesture on capture affordance 621, device 600 captures a still imageof the image data represented in image display region 620 at the timethe tap gesture occurs. If device 600 detects a tap-and-hold gesture oncapture affordance 621, device 600 captures a video recording of theimage data represented in image display region 620 during a period oftime for which the tap-and-hold gesture persists. In some embodiments,the video recording stops when the finger lifts off of the affordance.In some embodiments, the video recording continues until a subsequentinput (e.g., a tap input) is detected at a location corresponding to theaffordance. In some embodiments, the captured image (e.g., still imageor video recording) is then inserted into message-compose field 608 tobe subsequently sent to a participant in the message conversation. Insome embodiments, the captured image is sent directly to the participantin the message conversation without inserting the captured image inmessage-compose field 608.

In FIG. 6D, device 600 detects input 623 (e.g., a tap gesture on display601) on effects affordance 622.

In FIG. 6E, in response to detecting input 623, device 600 activatescamera 602 (e.g., switches from the rear-facing camera) and updatesimage display region 620 to display live camera preview 620-1 fromcamera 602, showing a representation of subject 632 positioned in thefield-of-view of camera 602 and background 636 displayed behind subject632. As discussed herein, image data captured using camera 602 includes,in some embodiments, depth data that can be used to determine a depth ofobjects in the field-of-view of camera 602. In some embodiments, device600 parses objects (e.g., in image data) based on a detected depth ofthose objects, and uses this determination to apply the visual effectsdiscussed herein. For example, device 600 can categorize subject 632 asbeing in the foreground of the live camera preview 620-1 and objectspositioned behind the user as being in the background of the live camerapreview 620-1. These background objects are referred to generally hereinas background 636.

Device 600 also highlights effects affordance 622 to indicate visualeffects are enabled for display, and updates camera options region 625by replacing camera option affordances 619 with visual effects optionaffordances 624. The visual effects option affordances include avatareffects affordance 624-1 and sticker effects affordance 624-2. Visualeffects option affordances 624 correspond to different visual effectsthat can be applied to the image displayed in image display region 620.By selecting one of the visual effect option affordances (e.g., 624-1 or624-2) a menu is displayed with visual effects options corresponding tothe selected visual effects option affordance.

A user can activate or deactivate the effects mode of device 600 byselecting effects affordance 622. When effects affordance 622 ishighlighted, an effects mode of device 600 is enabled to display visualeffects in image display region 620. If a user taps on highlightedaffordance 622, effects affordance 622 is no longer highlighted, and theeffects mode is disabled such that visual effects are not enabled fordisplay in image display region 620. In some embodiments, when theeffects mode is enabled, device 600 updates the image shown in imagedisplay region 620 to display one or more visual effects that have beenapplied to the image (including visual effects that are applied to alive image stream) and, when the effects mode is disabled, device 600removes or hides the visual effects from the image shown in imagedisplay region 620.

In FIG. 6F, device 600 detects input 626 (e.g., a tap gesture on display601) on avatar effects affordance 624-1.

In FIG. 6G, in response to detecting input 626, device 600 displaysavatar options menu 628 with a scrollable listing of avatar options 630.Avatar options menu 628 also includes selection region 629 forindicating a selected one of avatar options 630. As shown in FIG. 6G,robot avatar option 630-3 is positioned in selection region 629, whichindicates robot avatar option 630-1 is selected. Other avatar options630 shown in FIG. 6G include customized woman avatar option 630-1, nullavatar option 630-2, rabbit avatar option 630-4, and alien avatar option630-5. As discussed in detail below, null avatar option 630-2corresponds to an avatar option for which no avatar is displayed on thesubject in image display region 620. In some embodiments, null avataroption 630-2 is the default selected avatar option when avatar optionsmenu 628 is displayed.

Avatar options 630 correspond to a virtual avatar visual effect appliedto a representation of the subject in image display region 620.Specifically, each avatar option 630 corresponds to a virtual avatarthat, when selected, is transposed onto the face of the subject in theimage display region, while other portions of the image in image displayregion (such as a background or other portions of the user, such astheir body) remain displayed. A user (e.g., subject 632) positioned inthe field-of-view of camera 602 can control visual aspects of thevirtual avatar by changing the pose (e.g., rotation or orientation) oftheir face, including moving various facial features (e.g., winking,sticking out their tongue, smiling, etc.). Details for controllingdisplay and movement of virtual avatars is provided in U.S. patentapplication Ser. No. 15/870,195, which is hereby incorporated byreference for all purposes.

In some embodiments, a virtual avatar is a representation of the userthat can be graphically depicted (e.g., a graphical representation ofthe user). In some embodiments, the virtual avatar is non-photorealistic(e.g., is cartoonish). In some embodiments, the virtual avatar includesan avatar face having one or more avatar features (e.g., avatar facialfeatures). In some embodiments, the avatar features correspond (e.g.,are mapped) to one or more physical features of a user's face such thatdetected movement of the user's physical features (e.g., as determinedbased on a camera such as a depth sensing camera) affects the avatarfeature (e.g., affects the feature's graphical representation).

In some examples, a user is able to manipulate characteristics orfeatures of a virtual avatar using a camera sensor (e.g., camera module143, optical sensor 164) and, optionally, a depth sensor (e.g., depthcamera sensor 175). As a user's physical features (such as facialfeatures) and position (such as head position, head rotation, or headtilt) changes, the electronic device detects the changes and modifiesthe displayed image of the virtual avatar to reflect the changes in theuser's physical features and position. In some embodiments, the changesto the user's physical features and position are indicative of variousexpressions, emotions, context, tone, or other non-verbal communication.In some embodiments, the electronic device modifies the displayed imageof the virtual avatar to represent these expressions, emotions, context,tone, or other non-verbal communication.

In some embodiments, the virtual avatars are customizable avatars (e.g.,customizable avatar 835). Customizable avatars are virtual avatars thatcan be selected and customized by a user, for example, to achieve adesired appearance (e.g., to look like the user). The customizableavatars generally have an appearance of a human character, rather than anon-human character such as an anthropomorphic construct of an animal orother nonhuman object. Additionally, features of the avatar can becreated or changed, if desired, using an avatar editing user interface(e.g., such as the avatar editing user interface discussed below withrespect to FIGS. 8AA-8AB). In some embodiments, customizable avatars canbe created and configured to achieve a customized physical appearance,physical construct, or modeled behavior.

In some embodiments, the virtual avatars are non-customizable avatars.Non-customizable avatars are virtual avatars that can be selected by auser, but generally are not fundamentally configurable, though theirappearance can be altered via face tracking, as described in more detailbelow. Instead, non-customizable avatars are preconfigured and generallydo not have feature components that can be modified by a user. In someembodiments, the non-customizable avatars have an appearance of anon-human character, such as an anthropomorphic construct of an animalor other nonhuman object (e.g., see robot avatar 633, rabbit avatar634). Non-customizable avatars cannot be created by a user or modifiedto achieve a significant change in the physical appearance, physicalconstruct, or modeled behavior of non-customizable avatars.

Because robot avatar 630-3 is selected in FIG. 6G, device 600 displaysrobot avatar 633 positioned on the face of subject 632 (e.g., the user)displayed in image display region 620. Device 600 displays the avatar sothat it remains fixed on the subject's face as he moves within thefield-of-view of camera 602. Device 600 also continually modifies theavatar based on any detected changes in the user's face, including poseof the face or changes in facial expressions, while maintaining thedisplayed subject's body and background 636 in the image shown in imagedisplay region 620.

As shown in FIG. 6H, if device 600 detects that the user's head movesbeyond the field-of-view of camera 602, device 600 displays, forexample, prompt 638 containing representation 640 of the selected avatar(e.g., robot avatar) and message 642 instructing the user to repositiontheir head in the field-of-view of camera 602. In some embodiments,prompt 638 is displayed with a blurred background 644 (includingblurring the user 632 and background 636). In some embodiments,displaying prompt 638 includes displaying an animation of the selectedavatar returning to a center location of image display region 620, andshowing a slowed movement of the avatar and its features as they appearto settle to a stop based on a physics model of the avatar features.

Once device 600 detects the user's head returning to the field-of-viewof camera 602, device continues updating the selected avatar (e.g.,robot avatar 633) based on changes detected in the user's face, as shownin FIG. 6I. In some embodiments, when device 600 detects the user's facereturning to the field-of-view of camera 602, device 600 displays avatar633 moving from the center position of image display region 620 to theposition of the user's face, and resumes modifying the avatar based ondetected changes to the user's face.

In FIGS. 6I-6K, device 600 detects input 644 (e.g., a scrolling gestureon display 601) on the displayed list of avatar options 630. As input644 moves in a leftward direction across display 601, device 600displays avatar options 630 scrolling to the left to reveal additionalavatar options (e.g., a poop avatar option and a fox avatar option). Insome embodiments, as one of the avatar options enters selection region629, device produces a haptic feedback (e.g., a tactile output, with orwithout an audio output) to indicate to the user that termination ofinput 644 at that time would result in selection of the avatar optioncurrently positioned within selection region 629. For example, in FIG.6K, device 600 produces haptic feedback 646 when alien avatar option630-5 is positioned within selection region 629.

In FIG. 6J, device 600 detects termination of input 644 and determinesalien avatar option 630-5 is positioned within selection region 629. Asa result, device 600 replaces robot avatar 633 with alien avatar 635corresponding to selected alien avatar option 630-5, while maintainingdisplay of the subject's body and background 636. In some embodiments,replacing the previously selected avatar (e.g., robot avatar 633) withthe currently selected avatar (e.g., alien avatar 635) includesdisplaying an animation of the previously selected avatar movingoff-screen (e.g., out of image display region 620) and an animation ofcurrently selected avatar moving from avatar options menu 628 to theuser's face in image display region 620. In some embodiments, facetracking of the currently selected avatar begins as the avatar movesfrom avatar options menu 628 to the user's face. In some embodiments,the background is blurred while the avatar switching operation occurs.

In FIGS. 6M-6N, device 600 detects a vertical gesture 646 (e.g., avertical swipe or touch-and-drag on display 601) on avatar options menu628 and, in response, enlarges avatar options menu 628 to enlarged view628-1, as shown in FIG. 6N. In FIG. 6N, avatar options 630 are arrangedin a matrix and can be selected as shown in FIG. 6O (e.g., gesture 648selects robot avatar option 630-3) to display a different avatar inimage display region 620.

In FIG. 6P, in response to gesture 648, device 600 shows (e.g., byde-enlarging the meu) the avatar options menu 628 with selected robotavatar option 630-3 (positioned in selection region 629) and robotavatar 633 displayed on the user's head in image display region 620.

In FIG. 6Q, device 600 detects input 649 (e.g., a tap gesture on display601) on close icon 650 to close application options menu 628 and returnto the camera user interface 615 shown in FIG. 6R, showing cameraoptions region 625, while maintaining display of robot avatar 633 on theuser's face in an image stream (e.g., live camera preview) on imagedisplay region 620. Camera options region 625 shows capture affordance621, visual effects option affordances 624, and highlighted effectsaffordance 622. The user can select avatar menu affordance 624-1 toagain display avatar options menu 628, as shown below in FIGS. 6S and6T.

For example, in FIG. 6S, device 600 detects input 652 (e.g., tap gestureon display 601) on avatar effects affordance 624-1. In FIG. 6T, inresponse to detecting input 652, device 600 displays avatar options menu628 having avatar options 630, which replaces camera options region 625,including capture affordance 621, effects affordance 622, and visualeffects option affordances 624. The avatar options menu 628 can beclosed again by selecting close affordance 650, as previously discussed,returning again to the camera application user interface 615 as shown inFIG. 6U.

In FIG. 6U, device 600 detects input 654 (e.g., tap gesture on display601) on sticker effects affordance 624-2. In FIG. 6V, in response todetecting input 654, device 600 displays sticker options menu 656 havinga scrollable listing of stickers 658. The stickers are static graphicalobjects that may be selected by a user and applied to the image in imagedisplay region 620. In some embodiments, a sticker can be selected by atouch-and-drag gesture that initiates on the selected sticker, drags thesticker to a desired location, and then places the sticker at thedesired location by terminating the drag (e.g., lifting finger). In someembodiments a sticker can be selected by a tap gesture, and thecorresponding sticker is then displayed at a position on the imagedisplay region 620. An example of such an embodiment is shown in FIGS.6W-6X, showing tap gesture 660 on helmet sticker 658-1, which is thendisplayed in the center of image display region 620.

FIGS. 6Y-6Z show that a placed sticker (e.g., sticker 658-1) can berepositioned on the image display region 620 by dragging the sticker toa new location (see drag gesture 662 moving sticker 658-1 to a newposition in FIG. 6Z).

FIG. 6AA shows that a placed sticker (e.g., sticker 658-1) can berotated using two-finger gesture 664.

FIG. 6AB shows that a placed sticker (e.g., sticker 658-1) can beresized using a pinch or de-pinch gesture. In FIG. 6AB, helmet sticker658-1 is enlarged in response to de-pinch gesture 666. The resized,rotated, and repositioned helmet sticker 658-1 is shown placed on robotavatar's head in FIG. 6AC.

In FIG. 6AD, device 600 detects input 668 (e.g. a tap gesture on display601) on close icon 670 to close sticker options menu 656 and return tothe camera application user interface 615 as shown in FIG. 6AE, whichincludes camera options region 625 showing capture affordance 621,visual effects option affordances 624, and highlighted effectsaffordance 622 indicating visual effects are enabled for display. InFIG. 6AE, device 600 shows the representation of the user in the imageon image display region 620, but modified with visual effects thatinclude robot avatar 633 (e.g., selected from avatar options menu 628)and helmet sticker 658-1 (e.g., selected from sticker options menu 656).The display of the visual effects is dynamic. For example, device 600continuously modifies robot avatar 633 in response to detected changesin the user's face. In some embodiments, positioning of sticker 658-1 isalso dynamic and changing based on the detected movements of the user,as discussed in greater detail below.

When visual effects are enabled for device 600 (e.g., effects affordance622 is shown highlighted), applied visual effects (such as avatars andstickers, for example) can be removed or hidden from image displayregion 620 by un-selecting highlighted effects affordance 622 (e.g.,selecting effects affordance 622 when it is highlighted to disablevisual effects). For example, in FIG. 6AF, device detects input 672(e.g., a tap gesture on display 601) on highlighted effects affordance622. In response, device 600 removes the displayed visual effects (e.g.,helmet sticker 658-1 and robot avatar 633 in FIG. 6AF) from imagedisplay region 620, as shown in FIG. 6AG, while maintaining display ofthe image stream (e.g., live camera preview 620-1) including subject 632and background 636 in the field-of-view of camera 602.

In some embodiments, after the visual effects mode is disabled (e.g., byun-selecting highlighted effects affordance 622), the removed visualeffects can be restored, for example, by reselecting effects affordance622 within a predetermined amount of time. For example, in FIG. 6AH,device detects input 674 (e.g., a tap gesture on display 601) onnon-highlighted effects affordance 622. In response to detecting input674, device 600 highlights effects affordance 622 and re-displays helmetsticker 658-1 and robot avatar 633, while continuing to display theimage stream (e.g., live camera preview 620-1) including the body ofsubject 632 and background 636, as shown in FIG. 6AI.

In FIG. 6AJ, device 600 detects input 676 (e.g., a tap gesture ondisplay 601) on capture affordance 621. In response to detecting input676, device 600 captures an image of the live camera preview 620-1 inFIG. 6AJ, and displays, in FIG. 6AK, media item 620-2 (e.g., a stillimage of the state of live camera preview 620-1 at the time device 600detected input 676 on capture affordance 621).

In some embodiments, when the image is captured (e.g., stored as a mediaitem), device 600 encodes depth data into the media item. Storing thedepth data in the media permits the later application of depth-basedeffects (e.g., effects based on the location of objects (e.g., theuser's face) in the z direction). In some embodiments, when the image iscaptured while an effect is applied, the effect is directly encoded inthe visual (e.g., RGB) information for improved compatibility with otherdevices.

In FIG. 6AK, device 600 displays camera application user interface 615showing the captured image (e.g., media item 620-2) in the image displayregion 620 (e.g., media item 620-2 replaces the live camera preview620-1 shown in image display region 620). Media item 620-2 is arepresentation of the live camera preview 620-1 at the time the captureaffordance 621 was selected. Thus, media item 620-2 includes helmetsticker 658-1 and robot avatar 633 displayed over the face of subject632, and background 636.

Device 600 also replaces the camera-specific affordances (e.g.,affordances 617 shown in FIG. 6AJ) with retake affordance 679 andrecipient identifier 606, which indicates a currently selected recipientof media item 620-2, should the user subsequently send the media item asdiscussed below.

Device 600 displays camera options region 625, including visual effectsoption affordances 624. Visual effects option affordances 624 can beselected to display their respective option menus, which can be used tomodify captured media item 620-2 (as well as recorded video media item620-4 discussed below).

Device 600 also updates camera options region 625 to replace captureaffordance 621 and camera selector affordance 627 with markup affordance677, edit affordance 678, and send affordance 680. Markup affordance 677allows a user to mark-up media item 620-2. Edit affordance 678 allows auser to edit media item 620-2 such as by cropping the image or adjustingother characteristics of media item 620-2. As seen in FIG. 6AK, the sendaffordance 680 is displayed within the same location/region in whichcapture affordance 621 was displayed, such that send affordance 680replaces the capture affordance 621 on the display.

Send affordance 680 allows the user to immediately send media item 620-2to the recipient indicated by recipient identifier 606. For example, inFIG. 6AL, device 600 detects input 682 (e.g., a tap gesture on display601) on send affordance 680, and immediately sends media item 620-2 toJohn, the contact corresponding to recipient identifier 606, as shown inmessaging user interface 603 of FIG. 6AM. In some embodiments, device600 sends media item 620-2 without displaying messaging user interface603. In some embodiments, such as that shown in FIG. 6AM, device sendsmedia item 620-2 and displays messaging user interface 603 having mediaitem 620-2 in message display region 604, to indicate media item 620-2was sent to a participant in the message conversation (e.g., John), anddisplaying application dock 690 positioned above keyboard display region612. Application dock 690 includes application affordances 692 that canbe selected to access different applications.

In some embodiments, media item 620-2 is not immediately sent to theparticipant in the messaging conversation. For example, in FIG. 6AN,instead of selecting send affordance 680, the user selects doneaffordance 618. In response to detecting input 684 (e.g. tap gesture ondisplay 601) on done affordance 618, device 600 displays representationof media item 685 in message-compose field 608 of messaging userinterface 603, as shown in FIG. 6AO. In this embodiment (shown in FIGS.6AO-6AR), instead of displaying application dock 690, device 600displays text-suggestion region 614 along with keyboard display region612 to allow the user to compose text (e.g., 687), or optionally addother message content (e.g., multimedia content), prior to sending themessage. In response to detecting input 686 (e.g., a tap gesture ondisplay 601) on send icon 688, device 600 sends representation of mediaitem 685 to the participant in the message conversation, as shown inFIGS. 6AQ-6AR.

In some embodiments, sending media item 685 includes sending the mediaitem with encoded depth data in the media item. Sending the media itemwith depth data in the media permits the later application (e.g., laterapplication by the recipient) of depth-based effects (e.g., effectsbased on the location of objects (e.g., the user's face) in the zdirection). In some embodiments, when the media item is sent, theeffects are directly encoded in the visual (e.g., RGB) information forimproved compatibility with other devices.

In some embodiments, after sending representation of media item 685,device 600 displays messaging user interface 603 as shown in FIG. 6ARhaving text-suggestion region 614 displayed above keyboard displayregion 612. In response to receiving input 694 (e.g., a tap gesture ondisplay 601) on application dock affordance 610, device 600 replacestext-suggestion region 614 with application dock 690, as shown in FIG.6AT.

In some embodiments, application dock 690 remains displayed in messaginguser interface 603 until a user selects message compose field 608 orkeyboard region 612 (actions associated with composing text for amessage). For example, as shown in FIG. 6AU, device 600 detects input696 (e.g., tap gesture on device 601) on keyboard display region 612. Inresponse, as shown in FIG. 6AV, device 600 removes application dock 690and displays text-suggestion region 614 above keyboard display region612. In some embodiments, if input 696 is an input on message-composefield (instead of keyboard display region 612), device 600 also removesapplication dock 690 and displays text-suggestion region 614, as shownin FIG. 6AV

FIG. 6AW shows messaging user interface 603 after application dockaffordance 610 is selected (e.g., in response to input 694 in FIG. 6AS).Application dock 690 is displayed above keyboard selection region 612(e.g., replacing text-suggestion region 614). In response to detectinginput 698 (e.g., a tap gesture on display 601) on application affordance692-1, device 600 displays application display region 699 at a locationthat was previously occupied by text-suggestion region 614 and keyboarddisplay region 612. Application display region 699 optionally includesapplication dock 690, which includes an indication 6100 of the selectedapplication affordance (e.g., application affordance 692-1), as shown inFIG. 6AX.

In some embodiments, application display region 699 includes graphicalobjects 6102 that can be selected for use in messaging user interface603. In some embodiments, the type of graphical object displayed inapplication display region 699 depends on the application affordancethat was selected to invoke display of the application display region.In the embodiment illustrated in FIG. 6AX, selected applicationaffordance 692-1 corresponds to an avatar application. Accordingly,graphical objects 6102-1 and 6102-2 displayed in application displayregion 699 correspond to avatars that can be selected for use in themessaging user interface. If the application affordance selected fromapplication dock 610 corresponds to a sticker application, thengraphical objects 6102 correspond to stickers that can be selected foruse in the messaging user interface.

As previously mentioned, the foregoing embodiments described withrespect to FIGS. 6C-6AX are discussed with respect to device 600operating in the still image capture mode of operation associated withphoto affordance 619-2. However, unless specified otherwise, theseembodiments also apply to operation of device 600 when the device isoperating in the video recording mode associated with video affordance619-1. As such, the following description of FIGS. 6AY-6BQ, whichillustrate many of the foregoing embodiments as applied to the device600 when operating in the video recording mode associated with videoaffordance 619-1, is abbreviated where appropriate to avoid redundancy.

FIG. 6AY-6AZ shows device 600 switching to a video recording mode inresponse to detecting input 6104 on video affordance 619-1 in cameraapplication user interface 615. When switching to video recording mode,device 600 removes camera specific affordances 617-2, 617-3, and 617-4and displays timer 6122.

FIGS. 6AZ-6BB show device 600 enabling effects mode in response to input6106 on effects affordance 622, and then displaying visual effectsoption affordances 624 in camera options region 625 while the videorecord mode is enabled.

FIGS. 6BC-6BE show device 600 displaying avatar options menu 628 inresponse to detecting input 6108 on avatar effects affordance 624-1.Avatar options menu 628 includes avatar options 630 that can be selectedto display a corresponding avatar on the subject's face while in videomode. Robot avatar option 630-3 is selected, and device 600 displaysrobot avatar 633 on subject's face, while maintaining display of thesubject's body and background 636. Close icon 650 is selected with input6110 to close avatar options menu 628 and return to camera optionsregion 625 having highlighted effects affordance 622 and displayedvisual effects option affordances 624.

FIGS. 6BF-6BH show device 600 generating a recorded video of the imagedata represented in image display region 620. Device 600 detects input6112 on capture affordance 621, which initiates capturing the image datarepresented in image display region 620. In some embodiments, device 600generates a video recording of the image data based on a determinationthat the video recording mode of device 600 is enabled. In someembodiments, device 600 generates a video recording or optionally astill image capture based on input 6112 detected on capture affordance621. For example, if input 6112 is detected as a tap gesture, device 600captures a still image of the image data represented in image displayregion 620. If input 6112 is detected as a tap-and-hold gesture, device600 captures a video recording of the image data, as discussed ingreater detail above.

In FIG. 6BG, in response to detecting input 6112 on capture affordance621, device 600 initiates a process for recording the image datarepresented in image display region 620, which includes the subject,background, and visual effects. Device 600 also modifies cameraapplication user interface 615 by removing camera effects region 625,expanding image display region 620, and displaying stop affordance 6114and, optionally, image capture affordance 6116 for capturing stillimages of the live camera recording. In FIGS. 6BG and 6BH, image displayregion 620 represents live camera recording 620-3 of objects within thefield-of-view of camera 602, with the addition of visual effects (e.g.,robot avatar 633) that are enabled in accordance with the variousselections made as discussed above. These visual effects, which can alsoinclude stickers and full-screen effects, are displayed in image displayregion 620 as part of the live image data, and are recorded to form aportion of the resulting media item. As discussed herein, these visualeffects can be optionally displayed or hidden in the media item afterthe media item is recorded.

In FIG. 6BH, device 600 detects changes in the user's face, and modifiesrobot avatar 633 based on the detected changes in the user's face, whilerecording the live image stream. In response to detecting input 6118 onstop affordance 6114, device 600 stops recording the image datarepresented in image display region 620, and displays the playbackinterface shown in FIG. 6BI.

In FIG. 6BI, device 600 displays recorded video media item 620-4 inimage display region 620 and displays retake affordance 679 andrecipient identifier 606, which indicates a currently selected recipientof video media item 620-4, should the user subsequently send the mediaitem as discussed herein.

Device 600 also removes stop affordance 6114 and image captureaffordance 6116, and displays camera options region 625 having videoscrubber 6120 for recorded video media item 620-4, effects affordance622, edit affordance 678, markup affordance 677, and send affordance680. Camera options region 625 also includes visual effects optionaffordances 624. Visual effects option affordances 624 can be selectedto display their respective option menus, which can be used to modifycaptured media item 620-4. FIGS. 6BJ-6BN illustrate an example ofchanging visual effects in the captured video media item 620-4. As seenin FIG. 6BI, the send affordance 680 is displayed within the samelocation/region in which capture affordance 6116 was displayed, suchthat send affordance 680 replaces the capture affordance 6116 on thedisplay.

FIGS. 6BJ-6BN show device 600 displaying avatar options menu 628 inresponse to detecting input 6124 on avatar effects affordance 624-1. Asseen in FIG. 6BK, avatar options menu 628 shows avatar options 630,including selected robot avatar option 630-3. As seen in FIG. 6BL,device 600 detects input 6126 to select rabbit avatar option 630-4, andupdates video media item 620-4 to show rabbit avatar 634 positioned onthe subject's head, while still displaying the subject's body andbackground 636, as seen in FIG. 6BM. As seen in FIG. 6BN, in response todetecting input 6128 on close icon 650, device 600 closes avatar optionsmenu 628 and displays camera options region 625 of FIG. 6BO. Cameraoptions region 625, including video scrubber 6120, updates fromdisplaying robot avatar 633 in FIG. 6BJ, to showing rabbit avatar 634.In some embodiments, in response to an input directed to the videoscrubber 6120 (e.g., a swipe or drag input including movement of acontact that starts at a location of video scrubber 6120), the devicescrubs through recorded video (e.g., changing the appearance of thepreview of the video and moving the scrubber or moving a playhead in thescrubber in accordance with the input). In some embodiments, scrubbingthrough the recorded video shows how the effects applied to the videochange over time (e.g., how stickers and/or avatars move and/or changeexpression as the user's head/face moves and or changes expression inthe field of view of the one or more cameras). In some embodiments,video scrubber 6120 includes selectable affordances for cropping thevideo (e.g., by dragging editing affordances at the ends of the videoscrubber to change a starting or ending time for the video scrubber).For example, in response to a user input directed to video scrubber 8120in FIG. 8AW, the device would scrub through the video with an avatar andstickers as shown in FIGS. 8AW-8AZ, but at a rate determined based onthe change in the input (e.g., the magnitude and direction of themovement of the contact on the touch-sensitive display). The interactionwith video scrubber 6120 described above (and video scrubber 8120described below) optionally applies in a similar manner to interactionwith other scrubbers described herein.

As discussed above with respect to FIGS. 6AK-6AO, video media item 620-4can be immediately sent to a participant in the message conversation inresponse to detecting an input (e.g., input 6130) on send affordance680, as shown in FIGS. 6BP and 6BQ. A representation of video media item620-4 can be displayed in the message-compose field 608 in response toan input on done affordance 618.

FIGS. 7A and 7B are a flow diagram illustrating a method for displayingvisual effects in a messaging application using an electronic device inaccordance with some embodiments. Method 700 is performed at a device(e.g., 100, 300, 500, 600) with a camera, a display apparatus, and oneor more input devices. Some operations in method 700 are, optionally,combined, the orders of some operations are, optionally, changed, andsome operations are, optionally, omitted.

As described below, method 700 provides an intuitive way for displayingvisual effects in a messaging application. The method reduces thecognitive burden on a user for applying visual effects to an image forsending in a messaging application, thereby creating a more efficienthuman-machine interface. For battery-operated computing devices,enabling a user to display visual effects in an image faster and moreefficiently conserves power and increases the time between batterycharges.

The electronic device (e.g., 600) displays (702), via the displayapparatus (e.g., 601), a messaging user interface (e.g., 603) of amessage conversation including at least a first participant, themessaging user interface including a camera affordance (e.g., 609, aselectable icon associated with a function for activating a cameraapplication).

The electronic device (e.g., 600) detects (704), via the one or moreinput devices, a first input (e.g., 616, a touch gesture on a touchscreen display at a location that corresponds to the camera affordance)directed to the camera affordance.

In response to detecting the first input (e.g., 616), the electronicdevice (e.g., 600) displays (706) a camera user interface (e.g., 615).The camera user interface includes (708) a capture affordance (e.g.,621, a selectable icon associated with a function for capturing imagedata using the camera of the electronic device).

In some embodiments, the camera user interface (e.g., 615) includes(710) an effects mode affordance (e.g., 622, a selectable iconassociated with a function for activating a mode in which various visualeffects are available for modifying image data) associated with a modein which visual effects are enabled for display in the captured imagedata. Including an effects mode affordance in the camera user interfaceenables the user to recognize that certain effects (e.g., visualeffects) can be applied to an image via the camera user interface.Providing additional control of the device enhances the operability ofthe device and makes the user-device interface more efficient (e.g., byhelping the user to provide proper inputs and reducing user mistakeswhen operating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments, thevisual effects can be added to a representation of image data within afield-of-view of the camera. In some embodiments, the visual effects canbe added to captured image data. In some embodiments, the visual effectsare based on depth data. In some embodiments, the electronic devicedetects, via the one or more input devices, a selection (e.g., 623) ofthe effects mode affordance. In some embodiments, in response todetecting the selection of the effects mode affordance, the electronicdevice transitions the electronic device from a first camera mode (e.g.,standard camera mode) to a second camera mode, different from the firstcamera mode (e.g., an effects camera mode; a mode in which variousvisual effects are available for modifying image data). In someembodiments, while the device is in the second camera mode, a visualindication that the second camera mode is operative is displayed (e.g.,the effects mode affordance is highlighted).

In some embodiments, further in response to detecting selection (e.g.,623) of the effects mode affordance (e.g., 622), the electronic device(e.g., 600) ceases to display the one or more camera mode affordances.In some embodiments, further in response to detecting selection of theeffects mode affordance, the electronic device displays a plurality ofeffects option affordances (e.g., 624, selectable icons each associatedwith a function for creating a visual effect). In some embodiments, theeffects option affordances include a sticker affordance (e.g., 624-2)and/or an avatar affordance (e.g., 624-1) at a location in the camerauser interface (e.g., 615) that was previously occupied by the one ormore camera mode affordances. In some embodiments, the locations atwhich the effects option affordances are displayed are any locations ina particular region (e.g., camera effects region 625) in which thecamera mode affordances were previously displayed. In some embodiments,the locations at which respective effects option affordances aredisplayed are the same locations that respective camera mode affordanceswere displayed in the region. In some embodiments, displaying theeffects option affordances includes replacing the camera modeaffordances with the effects option affordances).

In some embodiments, the electronic device (e.g., 600) detects, via theone or more input devices, selection (e.g., 654) of a first one of theeffects option affordances (e.g., stickers affordance 624-2). In someembodiments, in response to detecting selection of the first one of theeffects option affordances, the electronic device ceases to display theplurality of effects option affordances (e.g., 624) and displays aplurality of selectable graphical icons (e.g., 658, stickers). Ceasingto display the plurality of effects option affordances and displayingthe plurality of selectable graphical icons in response to detectingselection of the first one of the effects option affordances enables theuser to quickly and easily recognize that the first one of the effectsoption affordances relates to graphical icon (e.g., sticker) options,thereby enhancing the operability of the device and making theuser-device interface more efficient (e.g., by helping the user toprovide proper inputs and reducing user mistakes whenoperating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments,displaying the stickers includes displaying a region (e.g., sticker menu656) over the effects option affordances, where the region includes aplurality of sticker options that can be selected for display on animage represented in the camera user interface. In some embodiments, auser selects (e.g., 660) a sticker (e.g., 658-1) by tapping on thesticker, and the sticker is automatically displayed on the image (e.g.,at a default location such as the center of the image). In someembodiments, a user selects a sticker by touching the sticker anddragging it from the sticker menu onto the image. In some embodiments,while displaying the capture affordance and further in response todetecting selection of the first one of the effects option affordances,the electronic device ceases to display the capture affordance.

In some embodiments, the electronic device (e.g., 600) detects, via theone or more input devices, selection (e.g., 626) of a second one of theeffects option affordances (e.g., avatar affordance 624-1). In someembodiments, in response to detecting selection of the second one of theeffects option affordances, the electronic device ceases to display theplurality of effects option affordances (e.g., 624) and displays anavatar selection region (e.g., avatar menu 628) having a plurality ofavatar affordances (e.g., 630, displayed in a linear arrangement) (e.g.,affordances that represent avatars). Ceasing to display the plurality ofeffects option affordances and displaying an avatar selection region inresponse to detecting selection of the second one of the effects optionaffordances enables the user to quickly and easily recognize that thesecond of the effects option affordances relates to avatar selection,thereby enhancing the operability of the device and making theuser-device interface more efficient (e.g., by helping the user toprovide proper inputs and reducing user mistakes whenoperating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments, theavatar affordances correspond to avatars that are customizable,non-customizable, or a combination thereof. In some embodiments,displaying the avatar selection region includes displaying the region(e.g., avatar menu 628) over the effects option affordances, the regionincluding a plurality of avatar affordances that can be selected todisplay a corresponding avatar on an image represented in the camerauser interface.

In some embodiments, the electronic device (e.g., 600) detects, via theone or more input devices, a swipe input (e.g., 646, a vertical swipegesture) on the avatar selection region (e.g., 628). In someembodiments, in response to detecting the swipe input on the avatarselection region, the electronic device increases a size of the avatarselection region (e.g., 628-1) and displays the plurality of avataraffordances (e.g., 630, arranged in a matrix. Increasing a size of theavatar selection region and displaying the plurality of avataraffordances arranged in a matrix in response to detecting the swipeinput on the avatar selection region enables the user to (concurrently)view one or more additional selectable avatars that were not(concurrently) visible in the avatar selection region. Providingadditional control options without cluttering the UI with additionaldisplayed controls enhances the operability of the device and makes theuser-device interface more efficient (e.g., by helping the user toprovide proper inputs and reducing user mistakes whenoperating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments,increasing the size of the avatar display region includes extending theavatar display region in a vertical direction to present a full-screendisplay of the avatar display region, with the avatar affordancesdisplayed in a matrix in the avatar display region.

In some embodiments, the camera user interface (e.g., 615) furtherincludes a first representation of image data (e.g., a live camerapreview 620-1). Providing the first representation of image data (e.g.,a live camera preview) provides visual feedback about one or moremodifications (to an image) made by the user prior to saving/confirmingthe modifications. Providing improved visual feedback to the userenhances the operability of the device and makes the user-deviceinterface more efficient (e.g., by helping the user to provide properinputs and reducing user mistakes when operating/interacting with thedevice) which, additionally, reduces power usage and improves batterylife of the device by enabling the user to use the device more quicklyand efficiently. In some embodiments, further in response to detectingselection of the effects mode affordance (e.g., 624-1, 624-2), theelectronic device (e.g., 600), in accordance with a determination thatthe first representation of image data corresponds to image dataobtained from a second camera (e.g., a rear-facing camera), ceases todisplay the first representation of image data and displays a secondrepresentation of image data (e.g., a live camera preview), the secondrepresentation of image data corresponding to image data obtained fromthe camera (e.g., a front-facing camera). In some embodiments, arepresentation of image data corresponding to the front-facing cameraincludes a representation of a user positioned in the field-of-view ofthe front-facing camera.

In some embodiments, while the electronic device (e.g., 600) is in thesecond camera mode (e.g., an effects camera mode; a mode in whichvarious visual effects are available for modifying image data), theelectronic device receives a request to transition (e.g., a selection ofan active visual effects affordance) to the first camera mode (e.g., thenormal mode, a non-effects mode). In some embodiments, in response toreceiving the request to transition to the first camera mode, theelectronic device transitions the electronic device from the secondcamera mode to the first camera mode. In some embodiments, in accordancewith a first visual effect being active (e.g., actively applied tocaptured image data or a preview of image data for capture), theelectronic device deactivates (e.g., disabling, ceasing to display thedisplayed first visual effect) the first visual effect.

In some embodiments, after deactivating the first visual effect, theelectronic device (e.g., 600) detects, via the one or more inputdevices, subsequent selection of the effects mode affordance (e.g.,624-1, 624-2). In some embodiments, in response to detecting thesubsequent selection of the effects mode affordance, the electronicdevice, in accordance with a determination that the subsequent selectionof the effects mode affordance occurs within a predetermined amount oftime after deactivating the first visual effect, re-activates the firstvisual effect. Re-activating the first visual effect in accordance withthe determination that the subsequent selection of the effects modeaffordance occurs within a predetermined amount of time afterdeactivating the first visual effect enables a user to quickly andeasily revert back to a previous visual effect (e.g., without having tore-select/re-create the effect). Providing additional control optionswithout cluttering the UI with additional displayed controls enhancesthe operability of the device and makes the user-device interface moreefficient (e.g., by helping the user to provide proper inputs andreducing user mistakes when operating/interacting with the device)which, additionally, reduces power usage and improves battery life ofthe device by enabling the user to use the device more quickly andefficiently. In some embodiments, after selecting the effects modeaffordance to remove the visual effects from the image, if the effectsmode affordance is selected again within a predetermined time period,the removed visual effects are restored to the image.

The electronic device (e.g., 600) detects (712), via the one or moreinput devices, a second input (e.g., 676, a touch gesture on a touchscreen display at a location that corresponds to the capture affordance)directed to the capture affordance (e.g., 621).

In response to detecting the second input (714), the electronic device(e.g., 600) captures (716) image data using the camera (e.g., 602). Insome embodiments, capturing the image data includes, in accordance witha value of a characteristic (e.g., a duration of contact) of the secondinput meeting a first capture mode criteria (e.g., less than a thresholdduration), capturing the image data in a first image capture mode (e.g.,a photo capture mode, a still image capture mode). In some embodiments,capturing the image data includes, in accordance with the value of thecharacteristic of the second input meeting a second capture modecriteria (e.g., greater than a second threshold duration), capturing theimage data in a second image capture mode (e.g., a video capture mode, acontinuous capture mode). In some embodiments, the capture affordance(e.g., 621) is a multi-function capture affordance. In some embodiments,the electronic device captures a photo (e.g., a still image) when a tapis detected on the capture affordance. In some embodiments, theelectronic device captures a video (e.g., a continuous image) when apress-and-hold gesture is detected on the capture affordance.

In response to detecting the second input (e.g., 676) (714), theelectronic device (e.g., 600) ceases (718) to display the captureaffordance (e.g., 621).

In some embodiments, after capturing image data using the camera (e.g.,602), the electronic device (e.g., 600) displays a mark-up affordance(e.g., 677), an edit affordance (e.g., 678), and a retake affordance(e.g., 679). In some embodiments, while displaying the mark-upaffordance, the edit affordance, and the retake affordance, theelectronic device receives, via the one or more input devices, a fourthuser input. In some embodiments, in response to detecting the fourthuser input, in accordance with the fourth user input corresponding tothe edit affordance, the electronic device initiates a process forediting the captured image data. In some embodiments, the process forediting includes displaying one or more affordances for editing thecaptured image data. In some embodiments, in response to detecting thefourth user input, in accordance with the fourth user inputcorresponding to the mark-up affordance, the electronic device initiatesa process for marking-up the captured image data. In some embodiments,the process for editing includes displaying one or more affordances formarking up the captured image data. In some embodiments, in response todetecting the fourth user input, in accordance with the fourth userinput corresponding to the retake affordance, the electronic deviceinitiates a process for retaking the captured image data. In someembodiments, initiating the process for retaking the captured image dataincludes capturing new image data and replacing the captured image datawith the new image data.

In response to detecting the second input (e.g., 676) (714), theelectronic device (e.g., 600) displays (720) a send affordance (e.g.,680, a selectable icon associated with a function for sending capturedimage data to a participant in a conversation, or for presenting thecaptured image data in a compose region prior to subsequent sending) ata location in the camera user interface (e.g., 615) that was previouslyoccupied by the capture affordance (e.g., 621). Displaying the sendaffordance at the location in the camera user interface that waspreviously occupied by the capture affordance provides visual feedbackthat the captured image is ready to be transmitted to an intendedrecipient. Providing visual feedback to the user without cluttering theUI enhances the operability of the device and makes the user-deviceinterface more efficient (e.g., by helping the user to provide properinputs and reducing user mistakes when operating/interacting with thedevice) which, additionally, reduces power usage and improves batterylife of the device by enabling the user to use the device more quicklyand efficiently. In some embodiments, the location at which the sendaffordance (e.g., 680) is displayed is any location in a particularregion (e.g., camera effects region 625) in which the capture affordancewas previously displayed. In some embodiments, the location at which thesend affordance is displayed is the same location that the captureaffordance was displayed in the region. In some embodiments, displayingthe send affordance includes replacing the capture affordance with thesend affordance.

In some embodiments, while displaying the send affordance (e.g., 680),the electronic device (e.g., 600) displays (722) a representation of thefirst participant (e.g., an icon, picture, avatar, or other identifierassociated with the first participant). Displaying the representation ofthe first participant while displaying the send affordance enables theuser to quickly and easily recognize the intended recipient, therebyenhancing the operability of the device and making the user-deviceinterface more efficient (e.g., by helping the user to provide properinputs and reducing user mistakes when operating/interacting with thedevice) which, additionally, reduces power usage and improves batterylife of the device by enabling the user to use the device more quicklyand efficiently. In some embodiments, the representation of the firstparticipant serves as an indication to the user that the captured photowill be sent to the first participant). In some embodiments, therepresentation of the first participant is not displayed (724) prior tocapturing the image data using the camera (e.g., 602). In someembodiments, the camera user interface (e.g., 615) includescamera-specific affordances (e.g., 619, corresponding to filters,lighting options, timer options, etc.) that are displayed prior tocapturing the image data. In some embodiments, displaying therepresentation of the first participant replaces the displayedcamera-specific affordances with the representation of the firstparticipant.

The electronic device (e.g., 600) detects (726), via the one or moreinput devices, a third input (e.g., a touch gesture on a touch screendisplay at a location that corresponds to the send affordance 680)directed to the send affordance.

In response to detecting the third input, the electronic device (e.g.,600) initiates (728) a process (e.g., immediately sending or presentingthe captured image data in a compose region prior to subsequent sending)to send the captured image data to the first participant.

In some embodiments, prior to detecting the third input directed to thesend affordance (e.g., 680), the electronic device (e.g., 600) displaysa done affordance (e.g., a selectable icon associated with a functionfor closing the camera user interface to display the messaging userinterface). In some embodiments, the electronic device detects, via theone or more input devices, selection of the done affordance. In someembodiments, in response to detecting selection of the done affordance,the electronic device displays the messaging user interface (e.g., 603),the messaging user interface having a message-compose region (e.g.,608). In some embodiments, in response to detecting selection of thedone affordance, the electronic device displays a representation of thecaptured image data in the message-compose region. In some embodiments,selecting the done affordance closes the camera user interface anddisplays the captured image data in a message-compose field of themessaging user interface, without sending the captured image data.

In some embodiments, prior to detecting the third user input, theelectronic device (e.g., 600) is in a first image capture mode (e.g.,photo capture mode, video capture mode). In some embodiments, further inresponse to detecting the third user input, the electronic devicemaintains the first image capture mode. In some embodiments, theelectronic device can be configured (e.g., user configured) to captureimage data according to a plurality of modes (e.g., photo capture mode,video capture mode). In some embodiments, a selection of a image capturemode is persistent, even when the electronic transitions from a firstcamera mode (e.g., a standard camera mode, a non-effects camera mode) toa second camera mode (e.g., an effects camera mode).

In some embodiments, initiating the process to send the captured imagedata to the first participant includes sending the captured image datato the first participant (e.g., without displaying the messaging userinterface 615). In some embodiments, selecting the send affordance(e.g., 680) from the camera user interface immediately sends thecaptured image data to another participant in a message conversationwithout displaying any intermediate user interface or requiring furtherinput from the user.

In some embodiments, initiating the process to send the captured imagedata to the first participant includes re-displaying the messaging userinterface (e.g., 615), where the messaging user interface furtherincludes a keyboard region (e.g., 612) and an application menuaffordance (e.g., 610, a selectable icon associated with a function fordisplaying an application menu user interface). Re-displaying themessaging user interface as part of initiating the process to send thecaptured image data to the first participant provides visual feedbackthat the captured image data is being sent via the message conversation.Providing improved visual feedback to the user enhances the operabilityof the device and makes the user-device interface more efficient (e.g.,by helping the user to provide proper inputs and reducing user mistakeswhen operating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments, theapplication menu affordance is displayed adjacent a message-composefield in the messaging user interface. In some embodiments, theelectronic device (e.g., 600) detects, via the one or more inputdevices, selection of the application menu affordance. In someembodiments, in response to detecting selection of the application menuaffordance, the electronic device displays an application menu regionadjacent (e.g., above) the keyboard region, where the application menuregion has a plurality of application affordances (e.g., selectableicons each associated with a function for initiating an applicationassociated with the respective application affordance). In someembodiments, the application affordances include stickers affordancesand avatar affordances.

In some embodiments, the electronic device (e.g., 600) detects a fourthinput on the messaging user interface (e.g., 615). In some embodiments,in accordance with a determination the fourth input corresponds to alocation of the keyboard region (e.g., 612) or a location of amessage-compose region (e.g., 608) in the messaging user interface, theelectronic device ceases to display the application menu region anddisplays a text-suggestion region (e.g., a region having a listing ofsuggested words for convenient selection by a user) at a location in themessaging user interface that was previously occupied by the applicationmenu region.

In some embodiments, the electronic device (e.g., 600) detects selectionof one of the plurality of application affordances (e.g., a stickersaffordance or an avatar affordance) in the application menu region. Insome embodiments, in response to detecting selection of the applicationaffordance (e.g., 610), the electronic device ceases to display thekeyboard region (e.g., 612) and displays an application display regionat a location in the messaging user interface (e.g., 615) that waspreviously occupied by the keyboard region, where the applicationdisplay region includes a plurality of graphical objects (e.g., avatarsor stickers) corresponding to the selected application affordance. Insome embodiments, the selected application affordance is a stickersaffordance and the graphical objects displayed in the applicationdisplay region are stickers. In some embodiments, the selectedapplication affordance is an avatar affordance and the graphical objectsdisplayed in the application display region are avatars (e.g.,customizable avatars and/or non-customizable avatars).

Note that details of the processes described above with respect tomethod 700 (e.g., FIGS. 7A-7B) are also applicable in an analogousmanner to the methods described below. For example, methods 900, 1100,1300, and 1500 optionally includes one or more of the characteristics ofthe various methods described above with reference to method 700. Forexample, visual effects such as stickers and virtual avatars aredisplayed in image data in a camera application user interface, in amedia user interface, and in a user interface for live videocommunication sessions. For brevity, these details are not repeatedbelow.

FIGS. 8A-8BQ illustrate exemplary user interfaces for displaying visualeffects in a camera application, in accordance with some embodiments.The user interfaces in these figures are used to illustrate theprocesses described below, including the processes in FIGS. 9A and 9B.

In FIG. 8A, device 600 shows home screen 800 and detects input 801 oncamera application affordance 802.

In FIG. 8B, in response to detecting input 801, device 600 launches acamera application associated with camera application affordance 802 anddisplays camera application user interface 815. Camera application userinterface 815 is similar to camera application user interface 615discussed above with respect to the embodiments shown in FIGS. 6A-6BQ.

Camera application user interface 815 includes image display region 820which displays a representation of image data such as, for example,streamed image data (e.g., a live camera preview, live camera recording,or live video communications session) representing objects positionedwithin a field-of-view of a camera (e.g., a rear-facing camera or camera602), or a media item such as, for example, a photograph or a videorecording. In the embodiment illustrated in FIG. 8B, image displayregion 820 shows live camera preview 820-1′ from a rear-facing camera ofdevice 600.

Camera application user interface 815 also includes a region above imagedisplay region 820 that includes camera-specific affordances 817.Camera-specific affordances include affordance 817-1 associated with acamera flash function, affordance 817-2 associated with a camera modefunction, affordance 817-3 associated with a timer function, andaffordance 817-4 associated with a filter function.

Camera application user interface 815 also includes camera optionsregion 825 (similar to camera options region 625) positioned below imagedisplay region 820 (similar to image display region 620). Camera optionsregion 825 includes camera selector affordance 827 for switching betweencameras (e.g., a rear-facing camera and camera 602), and camera optionaffordances 819 associated with different capture modes in which acamera can record image data. For example, video affordance 819-1 isassociated with a function for activating a video recording capture modeof the camera, and photo affordance 819-2 is associated with a functionfor activating a still image capture mode of the camera. In theembodiments discussed below with respect to FIGS. 8B-8AQ, device 600 isin the still image capture mode of operation associated with photoaffordance 819-2. However, unless specified otherwise, these embodimentsalso apply to the video recording mode associated with video affordance819-1.

Camera options region 825 further includes effects affordance 822 forenabling and disabling a mode (visual effects mode, effects mode) ofdevice 600 in which device 600 is enabled or disabled for displayingvisual effects in image display region 820. Effects affordance 822 issimilar to effects affordance 622 and, therefore, has the samefunctionality as effects affordance 622, unless specified otherwise.Accordingly, effects affordance 822 can be selected to enable display ofvisual effects, and deselected to disable display of visual effects.

Camera options region 825 also includes capture affordance 821, whichfunctions in a manner similar to capture affordance 621 discussed above.Capture affordance 821 can be selected to capture image data representedin image display region 820. In some embodiments, device 600 capturesthe image data in a manner based on the currently enabled capture option(e.g., video recording capture mode or image capture mode). In someembodiments, device 600 captures the image data depending on the type ofgesture detected on capture affordance 821. For example, if device 600detects a tap gesture on capture affordance 821, device 600 captures astill image of the image data represented in image display region 820 atthe time the tap gesture occurs. If device 600 detects a tap-and-holdgesture on capture affordance 821, device 600 captures a video recordingof the image data represented in image display region 820 during aperiod of time for which the tap-and-hold gesture persists. In someembodiments, the video recording stops when the finger lifts off of theaffordance. In some embodiments, the video recording continues until asubsequent input (e.g., a tap input) is detected at a locationcorresponding to the affordance. In some embodiments, the captured image(e.g., still image or video recording) can be shared with other devices,for example, using a messaging application.

In FIG. 8C, device 600 detects input 823 on effects affordance 822.

In FIG. 8D, in response to detecting input 823, device 600 activatescamera 602 (e.g., switches from the rear-facing camera) and updatesimage display region 820 to display live camera preview 820-1 fromcamera 602, showing a representation of subject 832 positioned in thefield of view of camera 602 and background 836 displayed behind subject832. As discussed herein, image data captured using camera 602 includes,in some embodiments, depth data that can be used to determine a depth ofobjects in the field of view of camera 602. In some embodiments, device600 parses objects (e.g., in image data) based on a detected depth ofthose objects, and uses this determination to apply the visual effectsdiscussed herein. For example, device 600 can categorize subject 832 asbeing in the foreground of the live camera preview 820-1 and objectspositioned behind the user as being in the background of the live camerapreview 820-1. These background objects are referred to generally hereinas background 836.

Device 600 also highlights effects affordance 822 to indicate visualeffects are enabled for display, and updates camera options region 825by replacing camera option affordances 819 with visual effects optionaffordances 824. The visual effects option affordances include avatareffects affordance 824-1 and sticker effects affordance 824-2. Visualeffects option affordances 824 are similar to visual effects optionaffordances 624 described above. Visual effects option affordances 824correspond to different visual effects that can be applied to the imagedisplayed in image display region 820. By selecting one of the visualeffect option affordances (e.g., 824-1 or 824-2) a menu is displayedwith visual effects options corresponding to the selected visual effectsoption affordance.

In FIGS. 8E-8F, device 600 detects input 826 on avatar effectsaffordance 824-1, and displays avatar options menu 828. Avatar optionsmenu 828 is similar to avatar options menu 628. Avatar options menu 828includes a scrollable listing of avatar options 830 and selection region829 for indicating a selected one of avatar options 830. Displayedavatar options include customizable woman avatar 830-1, null avataroption 830-2, robot avatar option 830-3, rabbit avatar option 830-4, anda second customizable woman avatar 830-5. As shown in FIG. 8F, nullavatar option 830-2 is selected as the default avatar option 830. Thus,the representation of subject 832 displayed in image display region 820is shown with no avatar displayed on the subject's face. Avatar options830 can be preconfigured, like avatar options 830-3 or 830-4, or theycan be customizable avatar, such as avatar options 830-1 and 830-5.

Avatar options 830 have a similar functionality to avatar options 630.Thus, avatar options 830 correspond to a virtual avatar visual effectapplied to a representation of the subject in image display region 820.Specifically, each avatar option 830 corresponds to a virtual avatarthat, when selected, is transposed onto the face of the subject in theimage display region, while other portions of the image in image displayregion (such as a background or other portions of the user, such astheir body) remain displayed. A user (e.g., subject 832) positioned inthe field-of-view of camera 602 can control visual aspects of thevirtual avatar by changing the pose (e.g., rotation or orientation) oftheir face as discussed above.

Avatar options 830 can be scrolled by gestures on the avatar optionsmenu 828. For example, a horizontal gesture 844 a is shown in FIG. 8G toselect robot avatar option 830-3, which is then displayed as robotavatar 833 on the subject's face in image display region 820, as shownin FIG. 8H. Continued scrolling via horizontal gestures 844 b, scrollsadditional avatar options 830 onto the displayed avatar options menu828. In some embodiments, as the user scrolls the avatar options 830,image display region 820 shows an avatar (e.g., robot avatar 833 in FIG.8J) with blurred effect 803 applied to the represented image data behindthe avatar.

In FIG. 8K, device 600 displays customizable woman avatar option 830-5positioned within selection region 829, just prior to being selected(e.g., gesture 844 b has not yet terminated), and edit affordance 804displayed below customizable woman avatar option 830-5. Device 600continues to display robot avatar 833 with blurred effect 803. In someembodiments, device 600 displays edit affordance 804 in selection region829 when a customizable avatar option is positioned in selection region829. Edit affordance 804 can be selected to display an avatar editinguser interface, such as that discussed below with respect to FIG. 8AA.

In FIG. 8L, after avatar option 830-5 is selected (e.g., gesture 844 bis terminated). Device 600 removes robot avatar 833 from image displayregion 820 (while continuing the blurred effect 803) and displays ananimation of customizable woman avatar 835 appearing from the avataroption 830-5 within selection region 829 and moving to image displayregion 820 (e.g., at a position at the center of image display region820, or at a location corresponding to the subject's face), where avatar835 is modified based on detected changes to the subject's face. In someembodiments, the animation includes the avatar (e.g., 835) starting outas a static appearance (e.g., similar to a sticker matching thedisplayed avatar option) and then transitioning into a dynamic state(shown in FIG. 8L) in which the avatar (e.g., 835) is modified based ondevice 600 detecting changes to a user's face. In some embodiments, theavatar's transition from static to dynamic occurs prior to the avatarmoving from selection region. In some embodiments, the avatar'stransition from static to dynamic occurs as the avatar is moving fromselection region 829 to image display region 820.

As shown in FIGS. 8M and 8N, when avatar 835 is applied to the subject'sface, device 600 modifies avatar 835 based on detected changes tosubject's face, including changes to the position of the face within thefield of view of camera 602.

In some embodiments, when device 600 no longer detects the subject'sface within the field-of-view of camera 602, device 600 again applies ablurring effect 803 (similar to the blurring effect 644) to thebackground and displays prompt 838 instructing the user to return theirface to the field-of-view of camera 602. In the embodiment shown in FIG.8P, device 600 displays prompt 838 containing representation 840 of theselected avatar (e.g., customizable woman avatar) and message 842instructing the user to reposition their head in the field-of-view ofcamera 602. In some embodiments, displaying prompt 838 includesdisplaying an animation of the selected avatar (e.g., 835) returning toa center location of image display region 820, and showing a slowedmovement of the avatar and its features as they appear to settle to astop based on a physics model of the avatar features.

In some embodiments, when device 600 detects the user's face returningto the field-of-view of camera 602, device 600 displays avatar 835moving from the center position of image display region 820 to theposition of the user's face, and resumes modifying the avatar based ondetected changes to the user's face.

In some embodiments, avatar options menu 828 can be expanded with avertical gesture (e.g., 805) to display an enlarged version of theavatar options menu as shown in FIGS. 8Q-8T. The enlarged version of theavatar options menu displays a greater number of avatar options 830,including avatars of different types (e.g., customizable andnon-customizable). A new avatar option (e.g., robot avatar option 830-3)may be selected in the enlarged avatar display menu. Device 600 thendisplays the corresponding avatar (e.g., robot avatar 833) on the user'sface when the enlarged version of the avatar options menu is returned toits original (e.g., condensed) state, as shown in FIG. 8U.

In some embodiments, device 600 displays different avatars on the user'shead in response to detecting swipe gestures on image display region820. For example, in FIG. 8V, device 600 detects swipe gesture 806moving in a rightward direction across image display region 820. Inresponse to detecting swipe gesture 806, device 600 scrolls avataroptions 830 in avatar options menu 828. In some embodiments, themagnitude of the scroll is determined by the magnitude of the swipegesture. Thus, a user can scroll through multiple avatars based on themagnitude of the swipe gesture. In addition, the direction of scroll,and the corresponding movement of avatars on image display region 820,is determined based on a direction of swipe gesture 806. Therefore, thescrolling of avatar options and corresponding movement of transitioningavatars can be in a leftward direction if the swipe is in a leftwarddirection, or can be in a rightward direction if the swipe is in arightward direction (as shown in FIGS. 8V and 8W).

As the avatar options 830 begin to scroll, the currently selected avataroption (e.g. robot avatar option 830-3 in FIG. 8V) leaves selectionregion 829 and, as the scroll continues, a new avatar option (e.g.,avatar option 830-1 in FIG. 8W) enters selection region 829. As theavatar options scroll, device 600 also displays, in image display region820, an animation of the currently selected avatar (e.g., robot avatar833) moving off of the user's face and exiting the right side of imagedisplay region 820 while a next avatar (e.g., customizable avatar 831)enters image display region 820 from the left, and moves to the user'sface. In some embodiments, device 600 applies blurring effect 803 to theimage display region 820 while the avatars are scrolled across imagedisplay region 820.

In the embodiment illustrated in FIGS. 8V and 8W, null avatar option830-2 is not selected to better illustrate a transition from a currentlyselected avatar to a new avatar that appears on image display region820. However, it should be understood that null avatar option 830-2 canbe selected with a swipe gesture on image display region 820. In suchinstances, the currently selected avatar is moved off screen while thebackground is blurred, and then the blurring effect is removed todisplay the user's face with no avatar.

FIGS. 8X and 8Y show a selection of a different customizable avataroption (e.g., customizable avatar option 830-5) and the correspondingdisplayed customizable avatar 835. When customizable avatar option 830-5is displayed in selection region 829, edit affordance 804 is displayed.Edit affordance 804 can be selected (e.g., via input 807) to displayavatar editing user interface 808 shown in FIG. 8AA. Avatar editing userinterface 808 provides an interface for editing the avatar (e.g.customizable avatar 835) corresponding to the customizable avatar option(e.g., 830-5) associated with the selected edit affordance 804.

In some embodiments, applied visual effects can include a lightingeffect such as shadow 851 shown on the subject's neck below appliedcustom avatar 835 or light reflections on glasses. As device 600modifies avatar 835 to mirror the real-time movements of the user,device 600 also modifies the lighting effects on avatar 835 and thoseprojected onto the subject, including moving displayed locations ofreflections shadows based on a relative position of a modeled lightsource and avatar 835.

As shown in FIG. 8AA, avatar editing user interface 808 includes arepresentation of customizable avatar 835 and various avatar featureoptions 810 (e.g., face color options 810-1 and face shape options810-2) that represent currently selected avatar feature options andavailable feature options that can be selected to modify avatar 835. Theavatar feature options correspond to values for aspects of a particularavatar feature, specifically a currently selected avatar feature, suchas an avatar head feature indicated by highlighted avatar headaffordance 809-1. Avatar editing user interface 808 indicates a selectedface color option 810-1 a and a selected face shape option 810-2 a,which are represented in avatar 835. The displayed avatar featureoptions can be changed by selecting a different avatar feature. Forexample, in FIG. 8 AB, device 600 updates avatar editing user interface808 to display different avatar hair feature options (e.g., hair textureoptions 811-1 and hair color options 811-2) when the avatar hairaffordance 809-2 is selected.

Device 600 modifies avatar 835 when different avatar feature options areselected. For example, in FIG. 8AB, device 600 detects selection ofstraight hair texture option 811-2 a, and updates avatar 835 to have theselected straight hair texture option. In some embodiments, device 600modifies the representation of avatar 835 shown in avatar editing userinterface 808 in response to detected changes in a user's face.

In response to detecting selection of done affordance 812, device 600exits avatar editing user interface 808 and returns to cameraapplication user interface 815 showing the selected avatar option 830-5and corresponding avatar 835 updated based on the hair texture optionselected in the avatar editing user interface.

FIGS. 8AD-8AF show an embodiment in which a new customized avatar can becreated and added to avatar options menu 828. In response to detecting aselection of new avatar affordance 813, device 600 displays avatarediting user interface 808 having a representation of a default avatarwith no selected avatar feature options, as shown in FIG. 8AE. Device600 can modify the default avatar based on user selection of the avatarfeature options and save the avatar as a new avatar that is selectablefrom the avatar options menu 828. In FIG. 8AF, device 600 detectsselection of a cancel affordance and, therefore, foregoes saving themodified avatar in avatar options menu 828, as shown in FIG. 8AG.

In response to detecting selection of cancel icon 850 in FIG. 8AG,device 600 closes avatar options menu 828 and displays camera optionsregion 825 as shown in FIG. 8AH.

In response to detecting selection of sticker effects affordance 824-2,device 600 displays sticker options menu 856 having a scrollable listingof sticker 858 in FIG. 8AI. Sticker options menu 856 and stickers 858are similar to sticker options menu 656 and stickers 658 discussedabove. The stickers are static graphical objects that may be selected bya user and applied to the image in image display region 820. In someembodiments a sticker can be selected by a tap gesture, and thecorresponding sticker is then displayed at a position on the imagedisplay region 820. In some embodiments, a sticker can be selected by atouch-and-drag gesture that initiates on the selected sticker, drags thesticker to a desired location, and then places the sticker at thedesired location by terminating the drag (e.g., lifting finger). Anexample of such an embodiment is illustrated in FIGS. 8AI-8AK, wherehelmet sticker 858-1 is selected (e.g., input 853), dragged to aposition above (e.g., away from) custom avatar 835. In FIG. 8AK, device600 detects subject 832 move laterally from the position shown in FIG.8AJ. In response, device 600 displays helmet sticker 858-1 and avatar835 both moving laterally based on the lateral movement (e.g., directionand magnitude) of the representation of the subject.

In FIG. 8AK, device 600 detects input 854 on close affordance 855. Inresponse, device 600 closes sticker options menu 856 to display cameraoptions region 825 in FIG. 8AL. In FIG. 8AL, device 600 shows thatsubject 832 is shifted back to the center of the screen with their headtilted. In response to detecting this movement of subject 832, device600 displays helmet sticker 858-1 above customizable avatar 835 in FIG.8AL and shifted laterally (from the position shown in FIG. 8AK) based onthe movement of the subject's head to the center of the screen. Device600 displays helmet sticker 858-1 maintaining the relative spacing withrespect to avatar 835. However, helmet sticker 858-1 is not rotated(e.g., tilted) in response to the rotation (e.g., tilting) of thesubject's head, whereas avatar 835 is rotated (e.g., tilted), as shownin FIG. 8AL.

In FIG. 8AL, device 600 detects a selection of capture affordance 821,which causes device 600 to capture an image of the live camera preview820-1 displayed in image display region 820, including the visualeffects (e.g., customized avatar 835 and helmet sticker 858-1), as wellas other image data including subject 832 and background 836.

In FIG. 8AM, device 600 displays camera application user interface 815showing the captured image (e.g., media item 820-2) in the image displayregion 820 (e.g., media item 820-2 replaces the live camera preview820-1 shown in image display region 820). Media item 820-2 is arepresentation of the live camera preview 820-1 at the time the captureaffordance 821 was selected. Thus, media item 820-2 includes helmetsticker 858-1 and avatar 835 displayed over the face of subject 832, andbackground 836.

Device 600 also replaces the camera-specific affordances (e.g.,affordances 817 shown in FIG. 8AL) with retake affordance 879 and saveaffordance 818 for saving the captured media item 820-2.

Device 600 also updates camera options region 825 to replace captureaffordance 821 and camera selector affordance 827 with markup affordance877, edit affordance 878, and share affordance 880. Markup affordance877 allows a user to mark-up media item 820-2. Edit affordance 878allows a user to edit media item 820-2 such as by cropping the image oradjusting other characteristics of media item 820-2. Share affordance880 allows a user to send media item 820-2 to another device, such as,for example in a messaging application or email application.

Device 600 displays camera options region 825, including visual effectsoption affordances 824. Visual effects option affordances 824 can beselected to display their respective option menus, which can be used tomodify captured media item 820-2 (as well as recorded video media item820-4 discussed below). For example, FIGS. 8AN-8AP illustrate device 600adding a sticker to media item 820-2. Sticker effects affordance 824-2is selected in FIG. 8AN, which causes device 600 to display sickeroptions menu 856 in FIG. 8AO, which also shows selection of rabbitsticker 858-2, which is positioned on subject 832 in media item 820-2,shown in FIG. 8AP. Sticker options menu 856 is closed in FIG. 8AP,displaying camera options region 825 in FIG. 8AQ.

In FIG. 8AQ, device 600 also displays media item 820-2 with rabbitsticker 858-2 positioned on subject 832, along with other visual effects(e.g., avatar 835 and helmet sticker 858-1).

Stickers can be added to recorded media items that are in a video formatin a manner that is similar to that described above for media item 820-2(still image). For example, FIG. 8AR shows an embodiment in which videomedia item 820-4 is a recorded video. Media item 820-4 can be created ina manner similar to that described above for media item 620-4.

In FIG. 8AR, device 600 displays recorded video media item 820-4 inimage display region 820. Device 600 also displays camera options region825 having video scrubber 8120 for recorded video media item 820-4,effects affordance 822, edit affordance 878, markup affordance 877, andshare affordance 880. Camera options region 825 also includes visualeffects option affordances 824. Visual effects option affordances 824can be selected to display their respective option menus, which can beused to modify captured media item 820-4. For example, FIGS. 8AR-8AVillustrate device 600 adding stickers 858-2 and 858-3 to media item820-4.

In some embodiments, displayed stickers can have different modeledbehaviors in a video media item (or live video stream). For example,some stickers have an appearance of being applied to the display (e.g.,601) and remain static as objects in the image data move. An example ofsuch a sticker is demonstrated by heart sticker 858-3 in FIGS. 8AX-8BC.In some embodiments, stickers have this static behavior when the stickeris placed in the media item (e.g., 820-4) or live camera preview (e.g.,820-1) when device 600 does not detect the presence of all or a portion(e.g., head or face) of the representation of the subject in the mediaitem, live camera preview, or field of view of camera 602. For example,if a subject's face is completely off screen (e.g., not detected in thefield of view of camera 602), and a sticker is placed on image displayregion 820, then, once the face is detected in the field of view ofcamera 602 and is displayed on image display region 820 with thepreviously placed sticker, the sticker is stationary and does not trackany movement of the subject's face.

Other stickers have the appearance of being applied to the display andmoving to follow an object (e.g., an item in the field of view of thecamera including, for example, an avatar or a representation of thesubject) in the image. In some embodiments, the sticker is placed at alocation remote from the object the sticker follows. An example of sucha sticker is demonstrated by helmet sticker 858-1 in FIGS. 8AK and8AX-8BC. Helmet sticker 858-1 follows lateral movement of subject 832,but not rotational movement of the subject. Another example of such asticker is demonstrated by starfish sticker 858-5 in FIGS. 8BE-8BQ. Whenpositioned away from the subject's face, starfish sticker 858-5 followsmovement of the subject's face while maintaining a relative positionwith respect to the face (e.g., starfish sticker 858-5 follows movementof the subject's face forwards, backwards, and side-to-side, but notrotational movement of the subject's face).

Yet other stickers have the appearance of being applied to an object inthe field of view of camera 602 and move to follow the object within thefield of view (e.g., having an appearance of depth as the stickeradjusts with the object in the image). An example of such a sticker isdemonstrated by rabbit sticker 858-2 in FIGS. 8AX-8BC. Rabbit sticker858-2 moves with lateral and rotational movement of the subject(specifically, the subject's shoulder). Another example of such asticker is glasses sticker 858-4 in FIGS. 8BG-8BM. When positioned onthe subject's face in FIGS. 8BG-8BM, glasses sticker 858-4 followslateral movement (e.g., side to side, up/down) of the subject's face,movement of the subject's face forwards and backwards, and rotationalmovement of the subject's face. The glasses sticker 858-4 has theappearance of physically being applied (e.g., stuck) to the subject'sface.

In some embodiments, a sticker's behavior changes based on its positionrelative to objects in the media item (or live camera preview or fieldof view of the camera). In some embodiments, the behavior of a stickerchanges in response to detecting changes in the position of the stickerrelative to the object. Examples of such stickers are shown in FIGS.8BE-8BQ, described in greater detail below.

FIGS. 8AW-8BC show video playback and editing of media item 820-4, whichincludes avatar 835, helmet sticker 858-1, rabbit sticker 858-2, andheart sticker 858-3. As media item 820-4 advances during playback, heartsticker 858-3 remains static, having the appearance of being applied todisplay 601 and not moving. Helmet sticker 858-1 has the appearance ofbeing applied to the display, but moves (e.g., translates) in responseto changes in a position of avatar 835. Rabbit sticker 858-2 has theappearance of being applied on the shoulder of subject 832 in media item820-4. As the shoulder moves, rabbit sticker 858-2 changes positions,including orientation and displayed size, to give the appearance offollowing the subject's shoulder during playback of media item 820-4.

In some embodiments, a sticker or other virtual object that is appliedto a location of the representation of the field of view of one or morecameras that includes a respective object (e.g., a face, hand, or otherbody part of a user of the device) being tracked in three dimensions(e.g., via depth information from a depth sensor) is attached to therespective object such that the size and/or orientation of the virtualobject changes as the distance of the respective object from the one ormore cameras and/or the orientation of the respective object withrespect to the one or more cameras changes in addition to movinglaterally (e.g., side to side and/or up and down) as the respectiveobject moves laterally (e.g., side to side and/or up and down) in thefield of view of the one or more cameras. For example, as shown in FIGS.8AS-8AT, rabbit sticker 858-2 is placed on the representation of thesubject's shoulder. As the shoulder moves up and towards the camera inFIGS. 8AX-8AY, rabbit sticker 858-2 moves with the shoulder laterally(up) and enlarges as the shoulder moves towards the camera. In someembodiments, a sticker or other virtual object that is applied to alocation of the representation of the field of view of one or morecameras that is away (e.g., remote) from a respective object (e.g., aface, hand, or other body part of a user of the device) being tracked inthree dimensions (e.g., via depth information from a depth sensor) moveslaterally (e.g., side to side and/or up and down) as the respectiveobject moves laterally (e.g., side to side and/or up and down) in thefield of view of the one or more cameras without being attached to theobject such that the size and/or orientation of the virtual object doesnot change as the distance of the respective object from the one or morecameras and/or the orientation of the respective object with respect tothe one or more cameras changes. For example, as shown in FIGS. 8AX and8AY, helmet sticker 858-1 is placed at a location that is away from therepresentation of the subject. As the representation of the subjectmoves sideways and closer to the camera in FIGS. 8AX and 8AY, helmetsticker 858-1 moves laterally based on the movement of therepresentation of the user, but does not change in orientation or size.

In FIG. 8AY, device 600 detects input 882 (e.g., a swipe or draggesture) on scrubber 8120, and scrubs through media item 820-4 based onthe input. As shown in FIG. 8AZ, in response to input 882, device 600reverses through media item 820-4 to display the portion of media item820-4 that was previously displayed in FIG. 8AX. Device 600 detectsinput 884 on edit affordance 878.

In FIG. 8BA, device 600 enters an editing mode (for editing media item820-4) and displays selectable affordances 885 for cropping the video bydragging affordances 885 at the ends of the video scrubber 8120 (viainput 883) to change an ending time for the video scrubber.

As shown in FIG. 8BB, video scrubber 8120 is trimmed in response toinput 883. Device 600 detects input 886 on done affordance 887 and exitsthe editing mode.

In FIG. 8BC, the video scrubber 8120 is shown trimmed (e.g., having ashorter length than prior to entering edit mode in FIG. 8AZ). Device 600detects input 888 on effects affordance 822, which exits effects mode.

As demonstrated in FIG. 8BD, the visual effects displayed in media item820-4 are hidden from display in response to detecting input 888deselecting effects affordance 822.

In some embodiments, stickers can have a behavior that is determinedbased on conditions (e.g., position of the sticker relative to otherobjects, the presence (or absence) of objects when the sticker isplaced, etc.) of the sticker's placement. For example, in someembodiments, a sticker can have a first type of behavior when positionedremote from an object or region, and a second type of behavior whenpositioned on the object or region. In some embodiments, a sticker canhave a third type of behavior if an object is not present when thesticker is placed. In some embodiments, the behavior of the sticker canchange based on changes to the sticker's placement (e.g., relative to anobject).

For example, FIGS. 8BE-8BQ illustrate an example embodiment showing thebehavior of various stickers displayed in live camera preview 820-1, andchanges to the behavior of the stickers based on changes to thestickers' placement relative to the subject's face. In the embodimentillustrated in FIGS. 8BE-8BQ, the behavior of the stickers is determinedbased on the location of the stickers relative to the subject's face.

In FIG. 8BE, device 600 displays camera application user interface 815having an appearance similar to that shown in FIG. 8D, but havingstickers 858 displayed in image display region 820. Image display region820 shows live camera preview 820-1 from camera 602, showing arepresentation of subject 832 positioned in the field of view of camera602 and background 836 displayed behind subject 832. Glasses sticker858-4, starfish sticker 858-5, and baseball sticker 858-6 are eachdisplayed positioned away from (e.g., not on) the face of subject 832.The stickers shown in FIG. 8BE-8BQ can be placed and moved in the imagedisplay region in accordance with the various embodiments discussedherein.

When the stickers 858 are positioned away from (e.g., not on) thesubject's face, they have a first type of behavior. For example, thestickers follow movement of the subject's face laterally (e.g., side toside and up/down), forwards (e.g., towards camera 602), and backwards(e.g., away from camera 602), but not rotational movement of thesubject's face (e.g., not following the pitch and yaw of the subject'sface).

In FIG. 8BF, device 600 detects input 889 (e.g., a touch and draggesture on display 601) on glasses sticker 858-4. In response todetecting input 889, device 600 displays glasses sticker 858-4 movingwith the input (e.g., in the direction and magnitude of movement ofinput 889), as shown in FIGS. 8BG and 8BH.

In some embodiments, device 600 provides an indication when the stickermoves to a location that corresponds to the object the sticker isfollowing. For example, in FIG. 8BG, device 600 shows glasses sticker858-4 moving from a location away from the subject's face to a locationon the subject's face. Device 600 displays brackets 890 around thesubject's face and, optionally, produces haptic feedback 892 (e.g., atactile output, with or without an audio output) as glasses sticker858-4 moves onto the subject's face. In some embodiments, brackets 890provide a visual indication of a region in which glasses sticker 858-4can be placed so that they will be displayed on the subject's face(resulting in modified behavior for glasses sticker 858-4). Thus,brackets 890 (and haptic feedback 892) can indicate to a user when theyare dragging the sticker to a placement area (e.g., inside the brackets)that will position the sticker on the subject's face. In someembodiments, haptic feedback 892 indicates that the glasses sticker858-4 has moved from the region outside the subject's face (e.g., aregion in which the sticker has the first type of behavior) to theregion on the subject's face (e.g., the region in which the sticker hasthe second type of behavior).

In some embodiments, when device 600 detects that the position of thesticker (or other visual effect) moves to a location that corresponds tothe object the sticker is following, device 600 modifies the appearanceof the sticker based on the position of the object and modifies thebehavior of the sticker (in some embodiments, the behavior of thesticker is modified after detecting termination of input 889). Device600 also modifies the appearance and behavior of the sticker in anopposite manner when the sticker is moved from the location thatcorresponds to the object the sticker is following, to a location remotefrom the object.

In FIG. 8BH, when device 600 detects the position of glasses sticker858-4 on the subject's face (e.g., positioned within brackets 890),device 600 modifies the sticker based on the position (e.g., size andorientation) of the subject's face. For example, the glasses sticker858-4 increases in size (e.g., to proportionally match the size of thesubject's face), and rotates from its static position in FIG. BG, tomatch the position of the subject's face in FIG. 8BH. In someembodiments, the sticker is positioned on a plane that corresponds tothe subject's nose and transforms coordinates of its position as theface moves. In some embodiments, a face mesh (e.g., a depth mask) isused to determine the point of the nose. In some embodiments, thesticker is positioned on the plane, but does not conform to the shape ofthe subject's face.

Device 600 also changes the behavior of the sticker to a second type ofbehavior (e.g., a behavior different than the first type of behavior).As shown in FIG. 8BH, a second type of behavior can include a behaviorin which the sticker maintains its 3D position on the face, followingrotational movement of the subject's face (e.g., following the pitch andyaw of the face), in addition to lateral movement and movementforwards/backwards. The behavior of other stickers is not changed. Thus,baseball sticker 858-6 and starfish sticker 858-5 continue to have thefirst type of behavior because they are not placed on the subject'sface. Instead, starfish sticker 858-5 and baseball sticker 858-6continue to maintain a relative spacing from the face.

In some embodiments, brackets 890 persist with input 889 while input 889is positioned on the subject's face (e.g., within the brackets). Thus,when device 600 detects termination of input 889, device 600 ceasesdisplaying brackets 890, as shown in FIG. 8BI.

FIGS. 8BI and 8BJ illustrate device 600 rotating glasses sticker 858-4with the subject's face, while starfish sticker 858-5 and baseballsticker 858-6 remain stationary (unlike glasses sticker 858-4, they donot follow rotational movement of the subject's face).

In FIG. 8BK, the subject moves closer to camera 602. Glasses sticker858-4 moves with the forward movement of the subject's face andmaintains its rotated position with the subject's face. Similarly,baseball sticker 858-6 and starfish sticker 858-5 move with the forwardmovement of the subject's face, while maintaining a relative spacingwith respect to the subject's face. Baseball sticker 858-6 and starfishsticker 858-5 do not rotate with the subject's face.

In FIG. 8BK, device 600 detects input 893 on capture affordance 821 and,in response, captures an image of live camera preview 820-1.

In FIG. 8BL, device 600 displays camera application user interface 815showing the captured image (e.g., media item 820-5) in the image displayregion 820 (e.g., media item 820-5 replaces the live camera preview820-1 shown in image display region 820). Media item 820-5 is arepresentation of the live camera preview 820-1 at the time the captureaffordance 821 was selected. The embodiment shown in FIG. 8BL is similarto that shown in FIG. 8AM, but with media item 820-5 (instead of mediaitem 820-2), which includes glasses sticker 858-4, starfish sticker858-5, and baseball sticker 858-6 displayed relative to the face ofsubject 832 and background 836.

The display of visual effects, as discussed herein, is similar acrossdifferent embodiments. For example, unless specified otherwise, visualeffects can be displayed and manipulated in a similar manner in a cameraapplication, a messaging application, an avatar editing application, alive video messaging application, or any other application discussedherein. Additionally, visual effects can be displayed and manipulated ina similar manner across different types of image data. For example,unless specified otherwise, visual effects can be displayed andmanipulated in a similar manner in a live camera preview, a media item,streamed image data, or any other image data discussed herein. Forexample, FIGS. 8BL-8BQ illustrate displaying stickers in media item820-5 in a manner similar to that discussed above for live camerapreview 820-1. Specifically, FIGS. 8BL-8BQ demonstrate that stickers ina media item (e.g., 820-5) can be repositioned, and their respectivebehavior changed accordingly, in a manner similar to that discussedabove with respect to live image preview 820-1.

In FIG. 8BM, device 600 detects input 894 on glasses sticker 858-4 anddisplays brackets 890. Input 894 moves glasses sticker 858-4 off of thesubject's face in FIG. 8BN. As the glasses sticker is moved off theface, device 600 generates haptic feedback 892, ceases displayingbrackets 890, and modifies glasses sticker 858-4 back to its originalshape (e.g., the slightly tilted shape from when the glasses sticker waspositioned remote from the subject's face in FIG. 8BE). Device 600 alsomodifies the behavior of glasses sticker 858-4 back to the first type ofbehavior (the behavior associated with a sticker placed remote from anobject it tracks) and modifies the appearance of the sticker based onthe changed behavior. Thus, glasses sticker 858-4 is displayed having alarger size in FIGS. 8BN-8BQ (compared to the size of glasses sticker858-4 in FIGS. 8BE-8BG) to maintain the relative spacing of glassessticker 858-4 to the position of the subject's face in media item 820-5,similar to the size changes of starfish sticker 858-5 and baseballsticker 858-6 based on forward movement of the subject's face in FIGS.8BJ-8BK.

FIGS. 8BO-8BQ demonstrate placing a different sticker on the subject'sface in media item 820-5, and that different sticker being modifiedbased on a different behavior of the sticker in response to being movedonto the subject's face. For example, in FIG. 8BO, device 600 detectsinput 895 on baseball sticker 858-6. As baseball sticker 858-6 isdragged onto the subject's face in FIG. 8BP, device 600 displaysbrackets 890, generates haptic feedback 892 (e.g., a tactile output),and repositions baseball sticker 858-6 based on the angle of thesubject's face. In other words, the behavior of baseball sticker 858-6changes such that it follows rotational movement of the subject's face,similar to the way glasses sticker 858-4 followed rotational movement ofthe subject's face when it was placed on the subject's face.

In FIG. 8BQ, input 895 terminates and device 600 ceases displayingbrackets 890.

FIGS. 9A and 9B are a flow diagram illustrating a method for displayingvisual effects in a camera application using an electronic device inaccordance with some embodiments. Method 900 is performed at a device(e.g., 100, 300, 500, 600) with a camera and a display apparatus. Someoperations in method 900 are, optionally, combined, the orders of someoperations are, optionally, changed, and some operations are,optionally, omitted.

As described below, method 900 provides an intuitive way for displayingvisual effects in a camera application. The method reduces the cognitiveburden on a user for applying visual effects to an image viewed in acamera application, thereby creating a more efficient human-machineinterface. For battery-operated computing devices, enabling a user todisplay visual effects in an image faster and more efficiently conservespower and increases the time between battery charges.

The electronic device (e.g., 600) displays (902), via the displayapparatus (e.g., 601), a camera user interface (e.g., 815). The camerauser interface includes (904) a camera display region (e.g., 820)including a representation (e.g., 835) of image data captured via thecamera (e.g., 602).

In some embodiments, the image data includes (906) depth data (e.g.,image data that includes a depth aspect (e.g., depth data independent ofRGB data) of a captured image or video. In some embodiments, the imagedata includes at least two components: an RGB component that encodes thevisual characteristics of a captured image, and depth data that encodesinformation about the relative spacing relationship of elements withinthe captured image (e.g., the depth data encodes that a user is in theforeground, and background elements, such as a tree positioned behindthe user, are in the background. In some embodiments, the depth data isa depth map. In some embodiments, a depth map (e.g., depth map image)contains information (e.g., values) that relates to the distance ofobjects in a scene from a viewpoint (e.g., a camera). In one embodimentof a depth map, each depth pixel defines the position in the viewpoint'sz-axis where its corresponding two-dimensional pixel is located. In someexamples, a depth map is composed of pixels wherein each pixel isdefined by a value (e.g., 0-255). For example, the “0” value representspixels that are located at the most distant place in a “threedimensional” scene and the “255” value represents pixels that arelocated closest to a viewpoint (e.g., camera) in the “three dimensional”scene. In other examples, a depth map represents the distance between anobject in a scene and the plane of the viewpoint.) In some embodiments,the depth map includes information about the relative depth of variousfeatures of an object of interest in view of the depth camera (e.g., therelative depth of eyes, nose, mouth, ears of a user's face). In someembodiments, the depth map includes information that enables the deviceto determine contours of the object of interest in a z direction.

In some embodiments, the depth data has a first depth component (e.g., afirst portion of depth data that encodes a spatial position of thesubject in the camera display region; a plurality of depth pixels thatform a discrete portion of the depth map, such as a foreground or aspecific object) that includes the representation of the subject in thecamera display region (e.g., 820). In some embodiments, the depth datahas a second depth component (e.g., a second portion of depth data thatencodes a spatial position of the background in the camera displayregion; a plurality of depth pixels that form a discrete portion of thedepth map, such as a background), separate from the first depthcomponent, the second depth aspect including the representation of thebackground in the camera display region. In some embodiments, the firstdepth aspect and second depth aspect are used to determine a spatialrelationship between the subject in the camera display region and thebackground in the camera display region. This spatial relationship canbe used to distinguish the subject from the background. This distinctioncan be exploited to, for example, apply different visual effects (e.g.,visual effects having a depth component) to the subject and background).In some embodiments, all areas of the image data that do not correspondto the first depth component (e.g., areas of the image data that are outof range of the depth camera) are segmented out (e.g., excluded) fromthe depth map.

In some embodiments, the representation (e.g., 835) of image datacaptured via the camera (e.g., 602) is a live camera preview (e.g., astream of image data that represents what is in the field of view of thecamera).

In some embodiments, while the first camera display mode is active, theelectronic device (e.g., 600) detects a swipe gesture on the cameradisplay region (e.g., 820). In some embodiments, in response todetecting the swipe gesture on the camera display region (e.g., theelectronic device (e.g., 600) changes an appearance of the displayedrepresentation of the selected avatar option in the camera displayregion from a first appearance (e.g., an appearance based on thecurrently selected avatar option) to a second appearance (e.g., anappearance based on a different avatar option (e.g., a null avataroption or an avatar option corresponding to a different avatar,including avatars of different types (e.g., customizable,non-customizable))), where the second appearance corresponds to adifferent one of the plurality of avatar options (e.g., a differentavatar option included in the avatar selection region). Changing theappearance of the displayed representation of the selected avatar optionin response to detecting a swipe gesture on the camera display regionprovides the user with a quick and easy method to change arepresentation of a selected avatar. Providing additional controloptions without cluttering the UI with additional displayed controlsenhances the operability of the device and makes the user-deviceinterface more efficient (e.g., by helping the user to provide properinputs and reducing user mistakes when operating/interacting with thedevice) which, additionally, reduces power usage and improves batterylife of the device by enabling the user to use the device more quicklyand efficiently

In some embodiments, when the different one of the plurality of avataroptions is a null avatar option, the device (e.g., 600) ceases todisplay the representation of the avatar on the representation of thesubject (e.g., the device foregoes replacing image data of the user'shead with a virtual avatar). In some embodiments, when the different oneof the plurality of avatar options is an avatar option of a differentavatar character (including a customizable or non-customizable avatarcharacter), the device replaces the selected avatar character with thedifferent avatar character (e.g., the device replaces the representationof the avatar with a representation of a different avatar). In someembodiments, replacing the selected avatar character with the differentavatar character includes displaying an animation of the differentavatar character moving to the center of the screen. In someembodiments, replacing the selected avatar character with the differentavatar character includes displaying an animation of the differentavatar character moving to the user's head. In some embodiments,replacing the selected avatar character with the different avatarcharacter includes blurring the background while the selected avatar isbeing replaced. Displaying an animation (e.g., the different avatarcharacter moving to the center of the screen, the different avatarcharacter moving to the user's head, blurring the background) once/whilethe selected avatar character is replaced with the different avatarcharacter provides visual feedback that the avatar character is beingchanged. Providing improved visual feedback to the user enhances theoperability of the device and makes the user-device interface moreefficient (e.g., by helping the user to provide proper inputs andreducing user mistakes when operating/interacting with the device)which, additionally, reduces power usage and improves battery life ofthe device by enabling the user to use the device more quickly andefficiently. In some embodiments, the currently selected avatar optioncorresponds to an avatar of a first type (e.g., a customizable avatar),and the different avatar option corresponds to an avatar of a secondtype (e.g., a non-customizable avatar).

In some embodiments, changing the appearance of the displayedrepresentation of the selected avatar option in the camera displayregion (e.g., 820) from the first appearance to the second appearanceincludes moving a first version of the representation of the selectedavatar option, off of the display, the first version having the firstappearance. In some embodiments, changing the appearance of thedisplayed representation of the selected avatar option in the cameradisplay region from the first appearance to the second appearanceincludes moving a second version of the representation of the selectedavatar option to substantially the center of the display, the secondversion having the second appearance. Moving the first version of therepresentation of the selected avatar option off of the display andmoving the second version of the representation of the selected avataroption to substantially the center of the display provides visualfeedback that the first version is being replaced by the second version.Providing improved visual feedback to the user enhances the operabilityof the device and makes the user-device interface more efficient (e.g.,by helping the user to provide proper inputs and reducing user mistakeswhen operating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently.

In some embodiments, changing the appearance of the displayedrepresentation of the selected avatar option from the first appearanceto the second appearance includes moving a first version of therepresentation of the selected avatar off of the display, the firstversion having the first appearance. In some embodiments, changing theappearance of the displayed representation of the selected avatar optionfrom the first appearance to the second appearance includes moving asecond version of the representation of the selected avatar option tosubstantially the position of the representation of the subjectdisplayed in the camera display region (e.g., 820), the second versionhaving the second appearance. Moving the first version of therepresentation of the selected avatar option off of the display andmoving the second version of the representation of the selected avataroption to substantially the position of the representation of thesubject displayed in the camera display region provides visual feedbackthat the first version is being replaced by the second version.Providing improved visual feedback to the user enhances the operabilityof the device and makes the user-device interface more efficient (e.g.,by helping the user to provide proper inputs and reducing user mistakeswhen operating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently.

In some embodiments, changing the appearance of the displayedrepresentation of the selected avatar option from the first appearanceto the second appearance includes modifying the visual appearance of thebackground displayed in the camera display region (e.g., 820) (e.g.,blurring the background, desaturating the background).

The camera user interface also includes (908) a first affordance (e.g.,an affordance that corresponds to a virtual avatar) associated with afirst camera display mode (e.g., a mode in which image data of theuser's head is replaced with a virtual avatar).

In some embodiments, the camera user interface (e.g., 815) furtherincludes a sticker affordance (e.g., 824-2, an affordance thatcorresponds to a function for enabling the display of stickers)associated with a sticker display mode (e.g., a mode in which stickersare enabled to be applied to the image data). In some embodiments, whiledisplaying the image data (and optionally a representation of theselected avatar option) in the camera display region (e.g., 820), theelectronic device (e.g., 600) detects a gesture (e.g., FIG. 8AH)directed to the sticker affordance. In some embodiments, in response todetecting the gesture directed to the sticker affordance, the electronicdevice activates the sticker display mode, where activating the stickerdisplay mode includes displaying a sticker selection region (e.g., 856)including a plurality of sticker options (e.g., 858, stickers),detecting a selection of one of the plurality of sticker options (e.g.,858-2, a sticker) in the sticker selection region, and, in response todetecting the selection, displaying a representation (e.g., 858-2) ofthe selected sticker option on the image data in the camera displayregion. In some embodiments, the selected sticker option displayed onthe image data has the appearance of being placed on the display screen(e.g., 601) (e.g., similar to the appearance of a user placing aphysical sticker on the screen of the device), and does not move orotherwise interact with objects represented in the image data. In someembodiments, the selected sticker option displayed on the image data hasthe appearance of being placed on the display screen (e.g., similar tothe appearance of a user placing a physical sticker on the screen of thedevice), but does move based on movement of objects displayed in theimage data. For example, a sticker appears as being placed on thedisplay, and interacts with objects represented in the image data (e.g.,a person), however movement of the sticker is restricted to movementalong x- and y-axes. In other words, the sticker appears to be appliedto the screen of the device, but can be moved on the screen by objectsrepresented in the image data (e.g., a person in the image data appearsto touch the sticker or drag it across the screen). In some embodiments,the selected sticker option displayed on the image data has theappearance of being inserted into the image data as an interactiveobject that forms a portion of the image data. Providing the appearanceof being inserted into the image data as an interactive object thatforms a portion of the image data provides visual feedback that theselected sticker can act as an interactive object within the image.Providing improved visual feedback to the user enhances the operabilityof the device and makes the user-device interface more efficient (e.g.,by helping the user to provide proper inputs and reducing user mistakeswhen operating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In such embodiments, thesticker appears to move based on interactions with objects representedin the image data. For example, the sticker can appear as an objectsitting on a person's shoulders. As the person moves, the sticker moveswith the person's shoulders to maintain the appearance of sitting on theperson's shoulders. This includes movement along x- and y-axes, as wellas movement along a z-axis.

In some embodiments, while displaying the representation of the selectedsticker option (e.g., 858-1, 858-2, 858-3, 858-4, 858-5, 858-6) on theimage data in the camera display region (e.g., 820), the device (e.g.,600) detects lateral movement of the subject (e.g., 832) in the field ofview of the one or more cameras (e.g., 602). In response to detectingthe lateral movement of the subject in the field of view of the one ormore cameras, the device moves the representation of the selectedsticker option laterally in accordance with the movement of the subjectin the field of view of the one or more cameras (e.g., without regard toa relationship of the sticker to the subject) (e.g., see helmet sticker858-1 in FIGS. 8AV-8AY). Moving the representation of the selectedsticker option laterally in accordance with lateral movement of thesubject in the field of view of the one or more cameras provides visualfeedback that the selected sticker can act as an interactive objectwithin the image. Providing improved visual feedback to the userenhances the operability of the device and makes the user-deviceinterface more efficient (e.g., by helping the user to provide properinputs and reducing user mistakes when operating/interacting with thedevice) which, additionally, reduces power usage and improves batterylife of the device by enabling the user to use the device more quicklyand efficiently.

In some embodiments, while displaying the representation of the selectedsticker option (e.g., 858-1, 858-2, 858-3, 858-4, 858-5, 858-6) on theimage data in the camera display region (e.g., 820), the device (e.g.,600) detects rotation of the subject (e.g., 832) in the field of view ofthe one or more cameras (e.g., 602) (e.g., rotation relative to an axisperpendicular to the display; e.g., the subject turning their head). Inresponse to detecting the rotation of the subject in the field of viewof the one or more cameras, the device performs one or more of thefollowing steps. In accordance with a determination that therepresentation of the selected sticker option has (e.g., was placedwith) a first relationship to the subject (e.g., the sticker wasinitially (or is currently) placed at a location on the display thatcorresponds to the subject; e.g., the sticker is placed on therepresentation of the subject's face or other designated area (e.g.,shown with brackets (e.g., 890))), the device rotates the representationof the selected sticker option in accordance with a magnitude anddirection of the rotation of the subject (e.g., the sticker rotates andturns to follow the pitch and yaw of the subject's face) (e.g., seeglasses sticker 858-4 in FIGS. 8BH-8BM). In accordance with adetermination that the representation of the selected sticker optiondoes not have (e.g., was not placed with) the first relationship to thesubject (e.g., the sticker was initially (or is currently) placed at alocation on the display that was away from the subject; e.g., thesticker was placed outside the representation of the subject's face orother designated area), the device forgoes rotating the representationof the selected sticker option in accordance with the magnitude anddirection of the rotation of the subject (e.g., see starfish sticker858-5 and baseball sticker 858-6 in FIGS. 8BE-8BN). Selectively rotatingthe representation of the selected sticker option based on whether therepresentation of the selected sticker option has the first relationshipto the subject provides visual feedback about the behavior of theselected sticker option and indicates that it can act as an interactiveobject within the image. Providing improved visual feedback to the userenhances the operability of the device and makes the user-deviceinterface more efficient (e.g., by helping the user to provide properinputs and reducing user mistakes when operating/interacting with thedevice) which, additionally, reduces power usage and improves batterylife of the device by enabling the user to use the device more quicklyand efficiently.

In some embodiments, while displaying the representation of the selectedsticker option (e.g., 858-1, 858-2, 858-3, 858-4, 858-5, 858-6) on theimage data in the camera display region (e.g., 820), the device (e.g.,600) detects movement of the subject (e.g., 832) toward (or away from)the one or more cameras (e.g., 602). In response to detecting themovement of the subject toward (or away from) the one or more cameras,the device performs one or more of the following steps. In accordancewith a determination that the representation of the selected stickeroption has (e.g., was placed with) the first relationship to the subject(e.g., the sticker was initially (or is currently) placed at a locationon the display that corresponds to the subject; e.g., the sticker wasplaced when the representation of the subject's face was present (e.g.,detected) within the field of view of the camera), the device enlarges(or shrinks) the representation of the selected sticker option inaccordance with a magnitude of movement of the subject toward (or awayfrom) the one or more cameras. For example, the rabbit sticker 858-2enlarges as shown in FIGS. 8AX and 8AY in response to the representationof the subject's shoulder moving towards the camera. In another example,starfish sticker 858-5 and baseball sticker 858-6 enlarge as the subjectmoves towards the camera in FIGS. 8BJ-8BK. In accordance with adetermination that the representation of the selected sticker optiondoes not have (e.g., was not placed with) the first relationship to thesubject (e.g., the sticker was initially (or is currently) placed at alocation on the display that was away from the subject; e.g., thesticker was placed when the representation of the subject's face was notpresent (e.g., not detected) within the field of view of the camera),the device forgoes enlarging the representation of the selected stickeroption in accordance with the magnitude of movement of the subjecttoward (or away from) the one or more cameras (e.g., see helmet sticker858-1 and heart sticker 858-3 in FIGS. 8AV-8AY). Selectively enlarging(or shrinking) the representation of the selected sticker option basedon whether the representation of the selected sticker option has thefirst relationship to the subject provides visual feedback about thebehavior of the selected sticker option and indicates that it can act asan interactive object within the image. Providing improved visualfeedback to the user enhances the operability of the device and makesthe user-device interface more efficient (e.g., by helping the user toprovide proper inputs and reducing user mistakes whenoperating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently.

While a subject is positioned within a field of view of the camera(e.g., 602) and a representation of the subject and a background (e.g.,objects in the field of view of the camera other than the subject) aredisplayed in the camera display region (e.g., 820), the electronicdevice (e.g., 600) detects (910) a gesture directed to the firstaffordance. In some embodiments, the electronic device detects (e.g.,recognizes) that the subject is positioned in the field of view.

In some embodiments, the camera user interface (e.g., 815), whiledisplaying the capture affordance, further includes a camera displayregion (e.g., 820) including a representation of a live preview of afield of view of the camera (e.g., a stream of image data thatrepresents what is in the field of view of the camera). In someembodiments, while a subject is positioned within the field of view ofthe camera (e.g., the electronic device detects/recognizes that thesubject is positioned in the field of view) and a representation of thesubject and a background (e.g., objects in the field of view of thecamera other than the subject) are displayed in the camera displayregion, the electronic device (e.g., 600) displays a representation of aselected avatar on the representation of the subject in the cameradisplay region (e.g., a displayed head or face portion of the user isreplaced with (or overlaid by (e.g., opaquely, transparently,translucently)) a head of a virtual avatar that corresponds to theselected avatar). In some embodiments, while displaying therepresentation of the selected avatar on the representation of thesubject in the camera display region, the electronic device receives arequest to display an avatar selection region. In some embodiments, inresponse to receiving the request to display an avatar selection region,the electronic device ceases to display the capture affordance anddisplays (e.g., at a location in the camera user interface that waspreviously occupied by the capture affordance) an avatar selectionregion (e.g., avatar menu 828) having a plurality of avatar affordances.In some embodiments, in response to (or in conjunction with) the avatarselection region no longer being displayed, the capture affordance isdisplayed (e.g., re-displayed).

In some embodiments, the camera user interface (e.g., 815), whiledisplaying the capture affordance (e.g., 821), further includes a cameradisplay region (e.g., 820) including a representation of a live previewof a field of view of the camera (e.g., a stream of image data thatrepresents what is in the field of view of the camera). In someembodiments, while a subject is positioned within the field of view ofthe camera and a representation of the subject and a background (e.g.,objects in the field of view of the camera other than the subject) aredisplayed in the camera display region, the electronic device (e.g.,600) displays a representation of a selected avatar on therepresentation of the subject in the camera display region (e.g., adisplayed head or face portion of the user is replaced with (or overlaidby (e.g., opaquely, transparently, translucently)) a head of a virtualavatar that corresponds to the selected avatar). In some embodiments,the electronic device detects (e.g., recognizes) that the subject ispositioned in the field of view. In some embodiments, while displayingthe representation of the selected avatar on the representation of thesubject in the camera display region, the electronic device detects achange in pose (e.g., position and/or orientation) of the subject. Insome embodiments, the change in pose is detected when the user movestheir head or any facial features. In some embodiments, in response todetecting the change in pose of the subject, the electronic devicechanges an appearance of the displayed representation of the selectedavatar option based on the detected change in pose of the subject whilemaintaining display of the background (e.g., as described with respectto method 900 and FIGS. 9A-9B). Changing the appearance of the displayedrepresentation of the selected avatar option based on the detectedchange in pose of the subject while maintaining display of thebackground enables the user to quickly and easily recognize thatmovements of the avatar correspond to and/or are based on detectedmovements of the user. Providing additional control options enhances theoperability of the device and makes the user-device interface moreefficient (e.g., by helping the user to provide proper inputs andreducing user mistakes when operating/interacting with the device)which, additionally, reduces power usage and improves battery life ofthe device by enabling the user to use the device more quickly andefficiently.

In response to detecting the gesture directed to the first affordance,the electronic device (e.g., 600) activates the first camera displaymode. Activating the first camera display mode includes displaying (914)an avatar selection region (e.g., 829) (e.g., including a selected oneof a plurality of avatar options (e.g., affordances that representdifferent virtual avatars that can be selected to appear over the user'shead in the camera display region (e.g., 820) of the camera userinterface (e.g., 815)).

In some embodiments, the avatar selection region (e.g., 829) furtherincludes an option for ceasing to display the representation of theselected avatar option on the representation of the subject in thecamera display. In some embodiments, the electronic device (e.g., 600)receives a user input corresponding to selection of the option forceasing to display the representation of the selected avatar option onthe representation of the subject in the camera display region (e.g.,820). In some embodiments, in response to receiving a user inputcorresponding to selection of the option for ceasing to display therepresentation of the selected avatar option on the representation ofthe subject in the camera display region, the electronic device ceasesto display the representation of the selected avatar option on therepresentation of the subject in the camera display region.

In some embodiments, the avatar selection region (e.g., 829) includes anull avatar option (e.g., 830-2). When the null avatar option isselected, no avatar is displayed on the representation of the subject inthe camera display region (e.g., 820) (e.g., the device forgoesreplacing image data of the user's head with a virtual avatar). In someembodiments, the avatar selection region includes a “cancel” affordance(e.g., an “x” icon located in the corner of the avatar selectionregion). When the cancel affordance is selected, the device ceases todisplay the avatar selection region and, optionally, ceases to displayany selected avatar on the representation of the subject (e.g., thedevice foregoes replacing image data of the user's head with a virtualavatar).

In some embodiments, activating the first camera display mode (e.g., anavatar display mode in which image data of the user's head is replacedwith a virtual avatar) further includes, prior to displaying therepresentation of the selected avatar option on the representation ofthe subject in the camera display region (e.g., 820), displaying (916)the representation of the subject in the camera display region withoutdisplaying a representation of the selected avatar option on therepresentation of the subject. In some embodiments, after entering theavatar display mode, the device initially displays the representation ofthe subject without an avatar (e.g., the device foregoes replacing imagedata of the user's head with a virtual avatar). In some embodiments, theavatar option that is initially selected when entering the avatardisplay mode corresponds to a null avatar option. When the null avataroption is selected, the device foregoes replacing image data of thesubject's head with a virtual avatar.

Activating the first camera display mode includes displaying (918) arepresentation of the selected avatar option on the representation ofthe subject in the camera display region (e.g., 820) (e.g., a displayedhead or face portion of the user is replaced with (or overlaid by (e.g.,opaquely, transparently, translucently)) a head of a virtual avatar thatcorresponds to the selected avatar option). Displaying therepresentation of the selected avatar option on the representation ofthe subject in the camera display region enables the user to quickly andeasily recognize that selected avatar option relates to representationof the subject. Providing improved visual feedback to the user enhancesthe operability of the device and makes the user-device interface moreefficient (e.g., by helping the user to provide proper inputs andreducing user mistakes when operating/interacting with the device)which, additionally, reduces power usage and improves battery life ofthe device by enabling the user to use the device more quickly andefficiently. In some embodiments, displaying a representation of theselected avatar option on the representation of the subject includesusing depth information obtained using one or more depth cameras of theelectronic device (e.g., 600).

In some embodiments, activating the first camera display mode furtherincludes displaying the selected avatar option with a static appearance(e.g., the avatar appearance does not change based on detected changesin the user's face) in the avatar selection region (e.g., 829). In someembodiments, activating the first camera display mode further includesupdating the selected avatar option to have a dynamic appearance thatchanges based on the detected change in pose of the subject (e.g., theavatar changes to mirror the detected changes in the user's face). Insome embodiments, activating the first camera display mode furtherincludes displaying an animation of the selected avatar having thedynamic appearance moving from the avatar selection region to therepresentation of the subject (e.g., a representation of the user'sface) in the camera display region (e.g., 820). In some embodiments, theavatar continues to track changes in the user's face during the animatedmovement from the avatar selection region to the user's face in thecamera display region.

In some embodiments, updating the selected avatar option to have adynamic appearance that changes based on the detected change in pose ofthe subject includes initially displaying the avatar option having thedynamic appearance with an initial pose that corresponds (e.g., matches)a pose of the avatar option having the static appearance, prior tochanging the appearance of the avatar option based on the detectedchange in pose of the subject.

While the first camera display mode is active, the electronic device(e.g., 600) detects (920) a change in pose (e.g., position and/ororientation of the subject). In some embodiments, the change in pose isdetected when the user moves their head or any facial features.

In response to detecting the change in pose of the subject, theelectronic device (e.g., 600) changes (922) an appearance of thedisplayed representation of the selected avatar option based on thedetected change in pose of the subject while maintaining display of thebackground (e.g., 836) (e.g., the virtual avatar displayed on the useris responsive to detected changes in the user's head and face such thata change in the user's head or face effects a change in the displayedvirtual avatar while still displaying the background). Changing theappearance of the displayed representation of the selected avatar optionbased on the detected change in pose of the subject while maintainingdisplay of the background enables the user to quickly and easilyrecognize that movements of the avatar correspond to and/or are based ondetected movements of the user. Providing additional control optionsenhances the operability of the device and makes the user-deviceinterface more efficient (e.g., by helping the user to provide properinputs and reducing user mistakes when operating/interacting with thedevice) which, additionally, reduces power usage and improves batterylife of the device by enabling the user to use the device more quicklyand efficiently. In some embodiments, the camera user interface includesone or more features/functions of the camera user interface describedwith respect to the embodiment shown in FIGS. 6A-6BQ and the FIGS.9A-9B. For example, the camera user interface (e.g., 815) can include aneffects mode affordance (e.g., 622).

In some embodiments, the electronic device (e.g., 600) detects ahorizontal swipe gesture on the avatar selection region (e.g., 829). Insome embodiments, in response to detecting the horizontal swipe gesture,the electronic device displays an avatar creation affordance associatedwith a function for adding a new avatar option to the plurality ofavatar options. Displaying the avatar creation affordance associatedwith a function for adding a new avatar option to the plurality ofavatar options in response to detecting a horizontal swipe gestureenables the user to quickly and easily access the avatar creationaffordance from the avatar selection region. Providing additionalcontrol options without cluttering the UI with additional displayedcontrols enhances the operability of the device and makes theuser-device interface more efficient (e.g., by helping the user toprovide proper inputs and reducing user mistakes whenoperating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments, ahorizontal swipe gesture on the avatar selection region scrolls thedisplayed avatar options to reveal an avatar creation affordance. Insome embodiments, the avatar creation affordance can be selected tocreate a new avatar. When the new avatar is created, a new avatar optionrepresenting the created avatar is added to the plurality of avataroptions (e.g., 830) in the avatar selection region.

In some embodiments, while the first camera display mode is active, theelectronic device (e.g., 600) detects a swipe gesture on the avatarselection region (e.g., 829). In some embodiments, in response todetecting the swipe gesture on the avatar selection region, theelectronic device changes an appearance of the displayed representationof the selected avatar option in the camera display region (e.g., 820)from a first appearance (e.g., an appearance based on the currentlyselected avatar option) to a second appearance (e.g., an appearancebased on a different avatar option (e.g., a null avatar option or anavatar option corresponding to a different avatar, including avatars ofdifferent types (e.g., customizable, non-customizable))), where thesecond appearance corresponds to a different one of the plurality ofavatar options (e.g., a different avatar option included in the avatarselection region). Changing the appearance of the displayedrepresentation of the selected avatar option in response to detecting aswipe gesture on the avatar selection region enables the user to quicklyand easily change the appearance of the selected avatar option.Providing additional control options without cluttering the UI withadditional displayed controls enhances the operability of the device andmakes the user-device interface more efficient (e.g., by helping theuser to provide proper inputs and reducing user mistakes whenoperating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments,when the different one of the plurality of avatar options is a nullavatar option (e.g., 830-2), the device ceases to display therepresentation of the avatar on the representation of the subject (e.g.,the device forgoes replacing image data of the user's head with avirtual avatar). In some embodiments, when the different one of theplurality of avatar options is an avatar option of a different avatarcharacter (including a customizable or non-customizable avatarcharacter), the device replaces the selected avatar character with thedifferent avatar character (e.g., the device replaces the representationof the avatar with a representation of a different avatar). In someembodiments, replacing the selected avatar character with the differentavatar character includes displaying an animation of the differentavatar character moving to the center of the screen. In someembodiments, replacing the selected avatar character with the differentavatar character includes displaying an animation of the differentavatar character moving to the user's head. In some embodiments,replacing the selected avatar character with the different avatarcharacter includes blurring the background (e.g., 836) while theselected avatar is being replaced. In some embodiments, the currentlyselected avatar option corresponds to an avatar of a first type (e.g., acustomizable avatar), and the different avatar option corresponds to anavatar of a second type (e.g., a non-customizable avatar).

In some embodiments, while the first camera display mode is active, inresponse to a determination that the subject is no longer positioned inthe field of view of the camera (e.g., face tracking is lost), theelectronic device (e.g., 600) displays an animation of therepresentation of the selected avatar option moving to a center locationin the camera display region (e.g., 820). Displaying an animation of therepresentation of the selected avatar option moving to a center locationin the camera display region in response to a determination that thesubject is no longer positioned in the field of view of the cameraprovides visual feedback to the user that the user is no longer beingdetected by the camera. Providing improved visual feedback to the userenhances the operability of the device and makes the user-deviceinterface more efficient (e.g., by helping the user to provide properinputs and reducing user mistakes when operating/interacting with thedevice) which, additionally, reduces power usage and improves batterylife of the device by enabling the user to use the device more quicklyand efficiently. In some embodiments, when the user is no longerdetected in the field of view of the camera, the avatar moves to thecenter of the camera display region. In some embodiments, the backgroundis blurred when the user is no longer detected in the field-of-view ofthe camera.

In some embodiments, while the first camera display mode is active, inresponse to a determination that the subject is no longer positioned inthe field of view of the camera (e.g., face tracking is lost), theelectronic device (e.g., 600) modifies the visual appearance of thebackground (e.g., 836) displayed in the camera display region (e.g.,820) (e.g., blurring the background, desaturating the background).

In some embodiments, while the first camera display mode is active andthe representation of the selected avatar option (e.g., a representationof a customizable avatar option selected from the avatar selectionregion) is displayed on the representation of the subject in the cameradisplay region (e.g., 820) (e.g., image data of the user's head isreplaced with the customizable avatar), the electronic device (e.g.,600) detects a touch gesture (e.g., a tap gesture) on the selectedavatar option in the avatar selection region (e.g., 829). In someembodiments, in response to detecting the touch gesture, the electronicdevice displays an avatar editing user interface (e.g., a user interfacefor editing one or more features of the selected avatar option (e.g., aselected customizable avatar) having a plurality of options (e.g., editaffordances that are selectable to modify various features of thecustomizable avatar) for editing the selected avatar option. Displayingan avatar editing user interface in response to detecting a touchgesture on the selected avatar option in the avatar selection regionenables a user to quickly and easily access the avatar editing userinterface to edit the avatar. Providing additional control optionswithout cluttering the UI with additional displayed controls enhancesthe operability of the device and makes the user-device interface moreefficient (e.g., by helping the user to provide proper inputs andreducing user mistakes when operating/interacting with the device)which, additionally, reduces power usage and improves battery life ofthe device by enabling the user to use the device more quickly andefficiently.

In some embodiments, the camera user interface (e.g., 815) furtherincludes a second affordance (e.g., 824-2, an affordance thatcorresponds to a function for displaying stickers) associated with asecond camera display mode (e.g., a mode in which virtual effects (e.g.,stickers) are applied to the image data). In some embodiments, while thesubject is positioned within the field of view of the camera (e.g., 602)and the representation of the subject and the background (e.g., 836) aredisplayed in the camera display region (e.g., 820), the electronicdevice (e.g., 600) detects a gesture directed to the second affordance.In some embodiments, in response to detecting the gesture directed tothe second affordance, the electronic device activates the second cameradisplay mode, where activating the second camera display mode includesdisplaying a visual effects selection region including a plurality ofgraphical objects (e.g., stickers).

In some embodiments, while the second camera display mode is active, theelectronic device (e.g., 600) detects a selection of one of theplurality of graphical objects (e.g., a sticker) in the visual effectsselection region (e.g., 824). In some embodiments, in response todetecting the selection, the electronic device displays a representationof the selected graphical object in the camera display region (e.g.,820). In some embodiments, the selected sticker is displayed in thecamera display region during a live camera preview (e.g., 820-1). Insome embodiments, displaying the sticker in the live camera previewincludes immediately displaying the sticker at a default location (e.g.,the center of the screen) of the camera display region. In someembodiments, displaying the sticker in the live camera preview includesdisplaying an animation of the sticker moving from the visual effectsselection region to a location on the camera display region. In someembodiments, this animation is determined based on a drag gesture of theuser selection of the sticker (e.g., a gesture in which the user touchesthe sticker and drags it to a location on the camera display region).

In some embodiments, the representation of image data captured via thecamera (e.g., 602) is a media item (e.g., 820-2, a still image orrecorded video). In some embodiments, the camera user interface (e.g.,815) further includes a third affordance (e.g., an affordance thatcorresponds to a function for displaying stickers) associated with athird camera display mode (e.g., a mode in which virtual effects (e.g.,stickers) are applied to a photograph or recorded video). In someembodiments, the electronic device (e.g., 600) detects a gesturedirected to the third affordance. In some embodiments, in response todetecting the gesture directed to the third affordance, the electronicdevice activates the third camera display mode, where activating thethird camera display mode includes displaying a visual effects selectionregion including a plurality of graphical objects (e.g., stickers).

In some embodiments, while the third camera display mode is active, theelectronic device (e.g., 600) detects a selection of one of theplurality of graphical objects (e.g., a sticker) in the visual effectsselection region. In some embodiments, in response to detecting theselection, the electronic device displays a representation of theselected graphical object on the media item (e.g., 820-2) in the cameradisplay region (e.g., 820). In some embodiments, the selected sticker isdisplayed in the camera display region when viewing a photograph orrecorded video. In some embodiments, displaying the sticker on thephotograph or recorded video includes immediately displaying the stickerat a default location (e.g., the center of the screen) of the cameradisplay region. In some embodiments, displaying the sticker in thephotograph or recorded video includes displaying an animation of thesticker moving from the visual effects selection region to a location onthe camera display region. In some embodiments, this animation isdetermined based on a drag gesture of the user selection of the sticker(e.g., a gesture in which the user touches the sticker and drags it to alocation on the camera display region).

Note that details of the processes described above with respect tomethod 900 (e.g., FIGS. 9A and 9B) are also applicable in an analogousmanner to the methods described above and below. For example, method 700optionally includes one or more of the characteristics of the variousmethods described above with reference to method 900. For example,visual effects such as stickers and virtual avatars are displayed inimage data in a messaging application user interface. For anotherexample, method 1100 optionally includes one or more of thecharacteristics of the various methods described above with reference tomethod 900. For example, visual effects such as stickers and virtualavatars are displayed in image data in a media user interface. Foranother example, method 1300 optionally includes one or more of thecharacteristics of the various methods described above with reference tomethod 900. For example, visual effects such as stickers and virtualavatars are displayed in image data in a user interface for a live videocommunication session. For another example, method 1500 optionallyincludes one or more of the characteristics of the various methodsdescribed above with reference to method 900. For example, visualeffects such as stickers and virtual avatars are displayed in image datafor a camera user interface. For brevity, these details are not repeatedbelow.

FIGS. 10A-10AL illustrate exemplary user interfaces for displayingvisual effects in a media item viewing mode, in accordance with someembodiments. The user interfaces in these figures are used to illustratethe processes described below, including the processes in FIGS. 11A-11B.

In FIG. 10A, device 600 shows home screen 1000 and detects input 1001 onmedia viewer application affordance 1002.

In FIG. 10B, in response to detecting input 1001, device 600 launches amedia view application associated with media viewer applicationaffordance 1002 and displays media user interface 1005. Media userinterface 1005 includes a series of stored media items 1006 and a mediadisplay region 1008 for displaying a media item corresponding a selectedone of stored media items 1006. In FIG. 10B, media display region 1008shows media item 1010 corresponding to selected stored media item1006-1.

Media item 1010 is an image that does not include encoded depth data(e.g., it was captured by a camera that does not encode depth data intocaptured media items (e.g., a camera other than camera 602)). Thus,media item 1010 does not include depth data, which, as discussed herein,is used to enable certain visual effects in an image.

In FIG. 10C, device 600 detects input 1011 on edit affordance 1012. Inresponse, device 600 displays, in FIG. 10D, edit options display region1014. Edit option display region includes effects affordance 1016(similar to effects affordances 622, 822), media edit affordance 1018,and markup affordance 1020.

In FIG. 10E, device 600 detects input 1021 on effects affordance 1016,to enable a visual effects mode in which any visual effects associatedwith the displayed media item (e.g., media item 1010) are displayed. Inthe embodiment illustrated in FIG. 10E, media item 1010 does not includevisual effects; in particular, image 1010 does note include depth-basedvisual effects because media item 1010 does not include depth data.

In FIG. 10F, in response to detecting input 1021 activating effectsaffordance 1016, device 600 highlights effects affordance 1016 andexpands edit option display region 1014 to display effects optionaffordances 1024. In the embodiment illustrated in FIG. 10F, becausemedia item 1010 does not include depth data to enable visual effects,avatar effects affordance 1024-1 and sticker effects affordance 1024-2are shown as not selectable, whereas remaining visual effects optionaffordances 1024-3, 1024-4, and 1024-5 are selectable. In someembodiments, when the media item does not include depth data to enablevisual effects, avatar effects affordance 1024-1 and sticker effectsaffordance 1024-2 are not displayed.

Because media item 1010 does not include depth data to enabledepth-based visual effects, no depth-based visual effects are displayedin media item 1010 when effects affordance 1016 is selected. Thus, mediaitem 1010 remains unchanged in FIG. 10F.

In FIG. 10G, device 600 detects input 1025 on cancel affordance 1026,and returns to the user interface shown in FIG. 10H.

In FIG. 10H, device 600 detects input 1027 on stored media item 1006-2to select stored media item 1006-2.

In FIG. 10I, device 600 displays media item 1028 in media display region1008. Media item 1028 corresponds to selected stored media item 1006-2showing a subject 1032 and background 1036. Media item 1028 is an imagecaptured by a camera (e.g., camera 602) that encodes depth data intocaptured media items. As explained herein, the depth data encoded intomedia items enables display of visual effects, particularly visualeffects having a depth component.

In FIG. 10I, visual effects are not displayed in media item 1028,because effects mode is not enabled for media item 1028. However, device600 displays effects icon 1029, to indicate that media item 1028contains depth data, and is capable of displaying visual effects.

In FIG. 10J, device detects input 1030 on edit affordance 1012 anddisplays, in FIG. 10K, edit option display region 1014 showing effectsaffordance 1016 in a non-highlighted state to indicate that visualeffects are not enabled for display in media item 1028.

In FIG. 10L, device 600 detects input 1033 on effects affordance 1016 toenable visual effects mode. When visual effects mode is enabled, asshown in FIG. 10M, device 600 highlights effects affordance 1016 andexpands edit option display region 1014 to display effects optionaffordances 1024. In the embodiment illustrated in FIG. 10M, becausemedia item 1028 includes depth data to enable depth-based visualeffects, avatar effects affordance 1024-1 and sticker effects affordance1024-2 are shown as selectable, and visual effects are displayed inmedia item 1028. In the embodiment illustrated in FIG. 10M, thedisplayed visual effects include customizable avatar 1037, shadow 1038displayed below avatar 1037 on the subject's neck, helmet sticker 1039,and rabbit sticker 1040.

In some embodiments, device 600 modifies avatar 1037 based on detectedchanges in a user's face positioned in the field-of-view of camera 602,which is encoded in the depth data of media item 1028. Thus, althoughmedia item 1028 is described in this embodiment as a still image, itshould be appreciated that media item 1028 is not limited to a stillimage and may include other media items such as a recorded video,including a recorded video having depth data. Similarly, device 600 canmodify the position of stickers applied to media item 1028 based ondetected changes in the position of objects in the media item, which isencoded in the depth data.

Visual effects, including depth-based visual effects, can be applied tomedia item 1028 and edited in accordance with the embodiments discussedherein. For example, avatar effects affordance 1024-1 can be selected toremove, modify, and/or switch the selected avatar (e.g., avatar 1037) inaccordance with the various embodiments disclosed herein. Additionally,sticker effects affordance 1024-2 can be selected to remove, modify,and/or add stickers to media item 1028 in accordance with the variousembodiments disclosed herein.

FIGS. 10N-10P illustrate device 600 adding heart sticker 1042 to mediaitem 1028. These processes are discussed in greater detail above withrespect to FIGS. 6U-6AD, 8AH-8AK, and 8AR-8AY. For the sake of brevity,details of these processes are not repeated here.

In FIG. 10Q, device 600 detects input 1043 on markup affordance 1020. InFIG. 10R, device 600 replaces the displayed edit option display region1014 with markup option menu 1044 including various selectable markupoptions 1045 for adding markup effects to media item 1028. FIGS. 10S-10Tshow a process for adding text 1046 to media item 1028 by selecting textaffordance 1045-1.

In FIG. 10U, device 600 detects input 1047 on edit affordance 1012. InFIG. 10V, device 600 replaces the displayed edit option display region1014 with image edit menu 1048 including various image edit affordances1050. FIGS. 10V-10Y show a process for editing media item 1028 to applya vivid color filter by selecting filter affordance 1050-1, and thenselecting vivid filter option 1052. The vivid color filter applieschanges to both the background 1036 in media item 1028 as well as theapplied visual effects (e.g., the hair on avatar 1037). In someembodiments, the filter changes an appearance of the avatar and therepresentation of the field of view of the one or more cameras in asimilar manner to increase the similarity between the avatar and therepresentation of the field of view of the one or more cameras (e.g.,applying a comic book filter, a sketch drawing filter, a black and whitefilter, a greyscale filter, or the like). In some embodiments, theavatar has a cartoon-like appearance that does not match with the realworld that is in the field of view of the one or more cameras, byapplying a filter that changes both the appearance of the avatar and therepresentation of the field of view of the one or more cameras, theappearance of the avatar is unified with the rest of the representationof the field of view of the one or more cameras. In some embodiments,the filter is a filter that reduces a realism of the representation ofthe field of view of the one or more cameras (e.g., a sketch filter or acomic book filter). In some embodiments the filter is a filter thatreduces a 3D effect (e.g., flattens) both the appearance of the avatarand the appearance of the representation of the field of view of the oneor more cameras.

FIG. 10Z shows media item 1028 edited to display new heart sticker 1042,text 1046, and having vivid color filter applied.

FIGS. 10AA-10AD show device 600 switching from avatar 1037 to avatar1054. These processes are discussed in greater detail above with respectto FIGS. 6G-6Q, 6BD-6BE, 6BK-6BN, and 8F-8AG. For the sake of brevity,details of these processes are not repeated here.

In FIG. 10AE, device 600 displays media item 1028 having stickers (e.g.,1039, 1040, 1042), avatar 1054, text 1046, and vivid color filterapplied. The color filter in media item 1028 affects the hair of newavatar 1054 when new avatar 1054 is displayed in media item 1028.

In FIG. 10AE, device 600 detects swipe gesture 1055 on effects optionaffordances 1024 and, in response, scrolls effects option affordances1024 to display screen effects affordance 1024-3 in FIG. LOAF.

In FIG. 10AG, device 600 detects input 1057 on screen effects affordance1024-3 and, in response, displays screen effects menu 1060. Screeneffects menu 1060 includes various screen effects options 1062, whichare selectable to apply a full-screen visual effect to media item 1028.In FIG. 10AH, null effects option 1062-1 is selected (e.g., 1063) anddevice 600 displays no screen effects in media item 1028.

In FIG. 10AI, device 600 detects input 1064 on balloon effects option1062-2. In FIGS. 10AJ-AL, device 600 displays an animated effect ofballoons 1065 in media item 1028. In some embodiments, an animatedeffect is applied in a manner that emphasizes the depth data encoded inmedia item 1028. For example, in FIG. 10AJ-10AL, some of the balloonsare displayed appearing in front of subject 1062 and visual effects(e.g., rabbit sticker 1040, avatar 1054, text 1046). Some of theballoons are displayed appearing at a depth behind subject 1062 andvisual effects (e.g., rabbit sticker 1040, avatar 1054, heart sticker1042).

In some embodiments, the screen effects can interact with visual effectsand objects in media item 1028 based on the depth of the objects andvisual effects in media item 1028. For example, a confetti screen effectcan show confetti falling in front of, and behind, objects in media item1028 (e.g., subject 1062) and visual effects (stickers and an avatar),and also falling on top of these objects and visual effects. Forexample, the confetti can be displayed falling on the avatar and fallingoff the side of the avatar based on a physics model of the fallingconfetti.

FIGS. 11A and 11B are a flow diagram illustrating a method fordisplaying visual effects in a media item viewing mode using anelectronic device in accordance with some embodiments. Method 1100 isperformed at a device (e.g., 100, 300, 500, 600) with a displayapparatus. Some operations in method 1100 are, optionally, combined, theorders of some operations are, optionally, changed, and some operationsare, optionally, omitted.

As described below, method 1100 provides an intuitive way for displayingvisual effects in a media item viewing mode. The method reduces thecognitive burden on a user for displaying visual effects in an image orvideo, thereby creating a more efficient human-machine interface. Forbattery-operated computing devices, enabling a user to display visualeffects faster and more efficiently conserves power and increases thetime between battery charges.

The electronic device (e.g., 600) displays (1102), via the displayapparatus (e.g., 601), a media user interface (e.g., 1005). The mediauser interface includes (1104) a media display region (e.g., 1008)including a representation (e.g., 1010) of a media item (e.g., a stillimage or video). In some embodiments, the depth data corresponding tothe media item is obtained by a camera of the electronic device afterdetecting a prior selection of the effects affordance.

In some embodiments, the media item is a recorded image or video, andthe effects are applied based on the depth data after the media item isrecorded. In some embodiments, visual effects such as stickers, virtualavatars, and full screen effects can be added to image data, or changedto a different visual effect (e.g., replacing a sticker with a virtualavatar), after the image data is captured (e.g., recorded).

The media user interface (e.g., 1005) includes (1106) an effectsaffordance (e.g., 1016, an affordance associated with a function foractivating an image display mode (e.g., a mode in which depth data isdisplayed when the image data contains depth data)).

The electronic device (e.g., 600) detects (1108) a gesture (e.g., 1021)directed to the effects affordance (e.g., 1016). In some embodiments,the respective effects option (e.g., 1024-1) corresponds (1110) to aneffect for displaying an avatar in (e.g., overlaid on) the media item.In some embodiments, when displaying an avatar in the media item, imagedata of a person's head is replaced with a virtual avatar. Displayingthe avatar in the media item, where image data of a person's head isreplaced with a virtual avatar, provides visual feedback that the avatarrelates to and/or is associated with the person being replaced.Providing improved visual feedback to the user enhances the operabilityof the device and makes the user-device interface more efficient (e.g.,by helping the user to provide proper inputs and reducing user mistakeswhen operating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments, theavatar is customizable. In some embodiments, the avatar isnon-customizable.

In some embodiments, the respective effects option (e.g., 1024-3)corresponds (1112) to an effect for displaying a plurality of virtualobjects (e.g., 1062, confetti, balloons, etc.) moving in (e.g.,animatedly overlaid on) the media item. In some embodiments, atrajectory of the plurality of objects moving in the media item aremodified based on a presence of at least one of an object in (e.g.,represented in; identified in) the media item (e.g., an object that isencoded in the original media item, not an object that is the product ofan effect applied to the media item, such as a person in the originalimage or video, but not a virtual avatar) or a visual effect (e.g., anavatar) applied to the media item. In some embodiments, objects such asconfetti or balloons are displayed in front of, behind, and/or on a userin the image. In some embodiments, the image includes another effect,such as an avatar, and objects such as confetti or balloons aredisplayed in front of, behind, and/or landing on the avatar.

In some embodiments, the respective effects option (e.g., 1024-2)corresponds (1114) to an effect for displaying one or more selectablegraphical icons (e.g., 1042, stickers) in (e.g., overlaid on) the mediaitem.

In response to detecting the gesture directed to the effects affordance,the electronic device (e.g., 600) displays (1116) a plurality of effectsoptions (e.g., stickers affordance, avatar affordance) for applyingeffects to the media item concurrently with a representation of themedia item, including, in accordance with a determination (1118) thatthe media item is associated with corresponding depth data (e.g., asdescribed herein), the plurality of effects options include a respectiveeffects option (e.g., 1024) for applying effects (e.g., stickers,virtual avatars, full screen effects, etc.) based on the depth data. Insome embodiments, in response to detecting the gesture, the electronicdevice activates an image display mode (e.g., a depth-data-based imagedisplay mode.

In some embodiments, a sticker affordance (e.g., 1024-2) is selectableto display a plurality of sticker options (e.g., 1042) that can bedisplayed on the media item (e.g., still image or video) based on depthdata. For example, a sticker can be placed on the media item andmodified based on depth data associated with the media item. In someembodiments, a sticker is associated with a relative position of anobject in a video. Movement of the object in the video has a depthcomponent that is used to modify a displayed aspect (e.g., size,orientation, position, etc.) of the sticker based on the movement of theobject. For example, the sticker is displayed on the object, and as theobject moves away from the camera (e.g., 602) (e.g., backwards), thesticker gets smaller to give the appearance the sticker is moving awayfrom the camera with the object. In some embodiments, an avataraffordance (e.g., 1024-1) is selectable to display a plurality of avataroptions (e.g., FIG. 10AB) that can be displayed on the media item basedon depth data. For example, an avatar can be displayed on a face in animage or video and modified based on depth data associated with the facein the image or video. For example, as the face moves forwards,backwards, side-to-side, or in any other manner that affects a depthcomponent of the image data, the avatar is displayed and modified basedon the depth data. Thus, as the face moves, the avatar is displayedmoving in the same manner.

In response to detecting the gesture directed to the effects affordance,the electronic device (e.g., 600) displays (1116) a plurality of effectsoptions (e.g., 1024, stickers affordance, avatar affordance) forapplying effects to the media item concurrently with a representation ofthe media item, including, in accordance with a determination (1120)that the image data does not include the depth data, the respectiveeffects option is not available for activation in the plurality ofeffects options (e.g., the respective effects option is excluded fromthe displayed plurality of effects options or is disabled in thedisplayed plurality of effects options). The respective effects optionnot being available for activation in the plurality of effects optionsin accordance with a determination that the image data does not includethe depth data provides feedback that the image data does not includedneeded depth data. Performing an operation when a set of conditions hasbeen met without requiring further user input enhances the operabilityof the device and makes the user-device interface more efficient (e.g.,by helping the user to provide proper inputs and reducing user mistakeswhen operating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments,affordances that correspond to depth data (e.g., the effects affordance)are not displayed or are not selectable when the image data does notinclude depth data.

In some embodiments, the plurality of effects options (e.g., 1024)includes an option (e.g., 1020) for adding labels to (e.g., overlaid on)the media item. In some embodiments, text labels can be added to theimage or video. In some embodiments, the plurality of effects optionsincludes an option for applying one or more image filters to (e.g.,overlaid on) the media item.

Note that details of the processes described above with respect tomethod 1100 (e.g., FIGS. 11A-11B) are also applicable in an analogousmanner to the methods described above and below. For example, method 700optionally includes one or more of the characteristics of the variousmethods described above with reference to method 1100. For example,visual effects such as stickers and virtual avatars are displayed inimage data in a messaging application user interface. For anotherexample, method 900 optionally includes one or more of thecharacteristics of the various methods described above with reference tomethod 1100. For example, visual effects such as stickers and virtualavatars are displayed in image data in a user interface for a cameraapplication user interface. For another example, method 1300 optionallyincludes one or more of the characteristics of the various methodsdescribed above with reference to method 1100. For example, visualeffects such as stickers and virtual avatars are displayed in image datain a user interface for a live video communication session. For anotherexample, method 1500 optionally includes one or more of thecharacteristics of the various methods described above with reference tomethod 1100. For example, visual effects such as stickers and virtualavatars are displayed in image data for a camera user interface. Forbrevity, these details are not repeated below.

FIGS. 12A-12AP illustrate exemplary user interfaces for displayingvisual effects in a live video communication session, in accordance withsome embodiments. The user interfaces in these figures are used toillustrate the processes described below, including the processes inFIGS. 13A-13B.

In FIG. 12A, device 600 shows a user interface 1200 for a live videocommunication session (e.g., a streamed video communication) between twoor more participants. In user interface 1200, device displaysparticipant image data 1204 representing a participant in the live videocommunication session. In window 1202, device 600 displays device imagedata 1201, which includes data obtained by a camera of device 600 (e.g.,camera 602) and representative of video data being transmitted to otherparticipants in the live video communication session. In FIG. 12A,device image data 1201 represents a user who is also a participant inthe video communication session (e.g., using an external electronicdevice (e.g., a device similar to device 600)). In FIG. 12A, deviceimage data 1201 is captured using camera 602 of device 600. In someembodiments, however, device image data 1201 can be captured using arear-facing camera of device 600.

In FIG. 12B, device 600 detects input 1205 on user interface 1200 and,in response, displays, in FIG. 12C, options display region 1206 havingeffects affordance 1208, camera selector affordance 1210, and endaffordance 1212. End affordance 1212 is associated with a function forterminating the live video communication session, and camera selectoraffordance 1210 is associated with a function for switching betweencameras (e.g., a rear-facing camera and camera 602). Effects affordance1208 is similar to effects affordance 622, 822, and 1016, and isselectable for enabling and disabling a mode (visual effect mode,effects mode) of device 600 in which device 600 is enabled or disabledfor displaying visual effects in user interface 1200.

In FIG. 12D, device 600 detects input 1213 on effects affordance 1208.

In FIG. 12E, in response to detecting input 1213, device 600 enablesvisual effects mode. Device 600 switches the displayed locations ofdevice image data 1201 and participant image data 1204 by displayingparticipant image data 1204 in window 1202, which is optionally moved toanother location in user interface 1200. Device 600 also highlightseffects affordance 1208 and expands options display region 1206 toinclude visual effects option affordances 1214. Visual effects optionaffordances 1214 are similar to visual effects option affordances 624,824, and 1024 discussed above, and are displayed when effects affordance1208 is enabled.

In FIG. 12F, device 600 detects input 1215 on avatar effects affordance1214-1. In FIG. 12G, in response, device 600 displays avatar optionsmenu 1216, which includes a scrollable listing of selectable avataroptions 1218. Avatar options menu 1216 is similar to avatar option menus628 and 828, and avatar options 1218 are similar to avatar options 630and 830.

Avatar options 1218 may be selected to apply a corresponding avatar tothe subject's face in device image data 1201 in a manner similar to thatdescribed above with respect to FIGS. 6G-6Q, 6BD-6BE, 6BK-6BN, and8F-8AG. In some embodiments, avatar options 1218 may be selected toapply a corresponding avatar in a similar manner to the participantdisplayed in participant image data 1204.

In FIG. 12G, because null avatar option 1218-1 is selected (e.g.,positioned within selection region 1219), no avatar is displayed on theuser's face in device image data 1201.

FIGS. 12H-12P illustrate a process for selecting, and switching between,various avatar options to display corresponding avatars on the user'sface in device image data 1201. For example, device 600 receives aselection of robot avatar option 1218-2 and displays robot avatar 1222on the user's face in device image data 1201, while maintaining displayof other objects (e.g., the user's body and background 1226) in deviceimage data 1201. Device 600 modifies the appearance of robot avatar 1222based on detected changes in the user's face using camera 602. Device600 then receives a selection of alien avatar option 1218-3 andtransitions the displayed avatar to alien avatar 1223. Device receives asubsequent selection of robot avatar option 1218-2 and again transitionsthe displayed avatar to robot avatar 1222. In some embodiments, a usercan select to remove (or forego displaying) an avatar in device imagedata 1201 by selecting null avatar option 1218-0. These processes arediscussed in greater detail above with respect to FIGS. 6G-6Q, 6BD-6BE,6BK-6BN, and 8F-8AG. For the sake of brevity, details of these processare not repeated here.

After detecting selection (via input 1221) of close icon 1220 in FIG.12P, device 600 closes avatar options menu 1216 and again displaysoptions display region 1206 in FIG. 12Q. Device 600 displays robotavatar 1222 on the user's head in device image data 1201 and modifiesthe avatar based on detected changes in the user's face using camera602.

In FIG. 12R, device 600 detects selection 1227 of sticker effectsaffordance 1214-2, and displays, in FIG. 12S, sticker options menu 1228having stickers 1230. Sticker options menu 1228 is similar to stickeroptions menu 656 and 856, and stickers 1230 are similar to stickers 658and 858.

Stickers 1230 may be selected to apply a corresponding sticker to deviceimage data 1201 in a manner similar to that described above with respectto FIGS. 6U-6AD, 8AH-8AK, and 8AR-8AY. In some embodiments, stickers1230 may be applied to participant image data 1204.

FIGS. 12T-12AA illustrate a process for applying and modifying stickersin device image data 1201. Device 600 detects selection of glassessticker 1230-1, and displays glasses sticker 1230-1 in device image data1201. Device 600 also detects subsequent gestures to move, resize, andposition glasses sticker 1230-1 on robot avatar 1222. These processesare discussed in greater detail above with respect to FIGS. 6U-6AD,8AH-8AK, and 8AR-8AY. For the sake of brevity, details of these processare not repeated here.

In some embodiments, selected stickers 1230 are not visible to otherparticipants in the live communication session until the user places thesticker in device image data 1201. In some embodiments, modifications toplaced stickers 1230 are not visible to other participants until themodification is complete. In some embodiments, once a selected stickerappears over device image data 1201 in user interface 1200, the stickeris visible to participants in the video communication session, even ifthe user has not yet placed the sticker 1230. Similarly, modificationsto placed stickers are visible such that continued adjustments of thesticker are visible to other participants in the live videocommunication session, even if the user is still modifying placement ofthe sticker.

In FIG. 12AB, robot avatar 1222 and glasses sticker 1230-1 are appliedto device image data 1201 and options display region 1206 is displayed.Device 600 detects gesture 1232 on user interface 1200 and, in response,switches the displayed locations of device image data 1201 andparticipant image data 1204, in FIG. 12AC, such that device image data1201 is displayed in window 1202, which is optionally moved to anotherlocation in user interface 1200. The visual effects (e.g., robot avatar1222 and glasses sticker 1230-1) are displayed in device image data 1201positioned in window 1202.

Device image data 1201 can be enlarged again (by again switchingpositions with participant image data 1204) in response to receivinggesture 1233 on window 1202, as shown in FIG. 12AD. Device 600 showsdevice image data 1201 and participant image data 1204 switched in FIG.12AE, with the displayed visual effects and options display region 1206.Because effects mode is enabled, when options display region 1206 isdisplayed, visual effects option affordances 1214 are also displayed.

The foregoing description for displaying visual effects in a live videocommunication session also applies to a live video communication sessionhaving three or more participants. FIGS. 12AF-12AP illustrate variousmethods for applying visual effects in such an embodiment.

In FIG. 12AF, device 600 displays user interface 1200 having deviceimage data 1201 in user window 1251 and respective participant imagedata 1204-1 to 1204-4 in respective participant windows 1255-1 to1255-4.

In FIG. 12AG, device 600 receives input 1256 on user interface 1200 and,in response, in FIG. 12AH, displays options display region 1206 havingeffects affordance 1208, camera selector affordance 1210, and endaffordance 1212. Because effects mode is not enabled when device 600receives input 1256, options display region 1206 is displayed withoutvisual effects option affordances 1214.

In FIG. 12AI, device 600 detects input 1257 on effects affordance 1208,which enables visual effects mode. Device 600 then displays user window1251 expanding to an enlarged view, as shown in FIG. 12AJ, while alsoapplying a blurred effect 1258 behind window 1251, and expanding optionsdisplay region 1206 to display visual effects option affordances 1214.Expanded window 1251 includes return icon 1260, which is selectable toreturn to the view shown in FIG. 12AI.

Visual effects can be applied to device image data 1201 using visualeffects option affordances 1214, as explained above. For example, device600 can apply stickers in a manner consistent with that described abovewith respect to FIGS. 12R-12AB. FIGS. 12AK-12AN illustrate a user addingcustomizable avatar 1262 to the user's face in device image data 1251.For example, avatar effects affordance 1214-1 is selected (1264) in FIG.12AK, which causes device 600 to display avatar options menu 1216 havingavatar options 1218, including null avatar option 1218-1 (selectable toforego displaying an avatar or removing a displayed avatar),non-customizable robot avatar option 1218-2, customizable avatar option1218-3, and new avatar affordance 1266 (selectable to create a newavatar in a manner similar to new avatar affordance 813). Customizableavatar option 1218-3 can be customized in a manner similar tocustomizable avatar option 830-5 discussed above.

In FIGS. 12AM-12AN customizable avatar option 1218-3 is selected, anddevice 600 displays customizable avatar 1273 on the user's face indevice image data 1201 displayed in window 1251. Device 600 modifiescustomizable avatar 1273 based on detected changes in the user's facepositioned in a field-of-view of camera 602. Device indicates when theuser's face is not detected within the field-of-view of camera 602, asexplained with respect to previous embodiments discussed herein.

In FIG. 12AN, device 600 detects input 1274 on return icon 1260 and, inresponse, displays device image data 1201 in window 1251 as shown inFIG. 12AO, including the applied visual effects (e.g., customizableavatar 1273). In the embodiment illustrated in FIG. 12AO, device imagedata 1201 and applied visual effects are visible to other participantsin the live video communication session. Because visual effects areenabled, options display region 1206 is displayed with visual effectsoption affordances 1214 and effects affordance 1208 highlighted.

In FIG. 12AO, device detects input 1275 on user interface 1200 and, inresponse, hides options display region 1206, and shifts and resizes theposition of various participant windows 1255, as shown in FIG. 12AP. Insome embodiments, device 600 enlarges a respective participant windowwhen the participant is speaking. For example, in FIG. 12AP, device 600detects audio 1276 (e.g., laughter) from the woman represented inparticipant image data 1204-2 and, in response, enlarges the woman'sparticipant window 1255-2.

FIGS. 13A-13B are a flow diagram illustrating a method for displayingvisual effects in a live video communication session using an electronicdevice in accordance with some embodiments. Method 1300 is performed ata device (e.g., 100, 300, 500, 600) with a display apparatus. Someoperations in method 1300 are, optionally, combined, the orders of someoperations are, optionally, changed, and some operations are,optionally, omitted.

As described below, method 1300 provides an intuitive way for displayingvisual effects in a live video communication session. The method reducesthe cognitive burden on a user for displaying visual effects, therebycreating a more efficient human-machine interface. For battery-operatedcomputing devices, enabling a user to display visual effects faster andmore efficiently conserves power and increases the time between batterycharges.

The electronic device (e.g., 600) displays (1302), via the displayapparatus (e.g., 601), a live video communication user interface (e.g.,1200) of a live video communication application. The live videocommunication user interface includes (1304) a representation (e.g.,1201) of a subject participating in a live video communication session.Including a representation of a subject participating in a live videocommunication session enables the user to quickly and easily recognizethe other participant(s) of the live video communication session.Providing improved visual feedback to the user enhances the operabilityof the device and makes the user-device interface more efficient (e.g.,by helping the user to provide proper inputs and reducing user mistakeswhen operating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments, therepresentation of the subject participating in the live videocommunication session includes (1306) image data captured by a camera(e.g., 602) associated with the electronic device. In some embodiments,the subject is a user of the electronic device. In some embodiments, therepresentation of the subject participating in the live videocommunication session includes image data transmitted to the electronicdevice from a second electronic device. In some embodiments, the secondelectronic device is a device of another user, and the subject is theother user.

In some embodiments, the live video communication user interface (e.g.,1200) further includes a representation (e.g., 1204) of a secondparticipant in the live video communication session and a representationof a third participant in the live video communication session. In someembodiments, displayed sizes of the representations of the second andthird participants in the live video communication session are adjustedso all representations of the participants can fit on the screen.Adjusting the sizes of the representations of the second and thirdparticipants to fit on the screen allows the user to simultaneously viewtheir reactions to the visual effects applied to the representation ofthe user, thereby enhancing the operability of the device and making theuser-device interface more efficient (e.g., by allowing the user toeasily view the reactions of other participants without manual inputs)which, additionally, reduces power usage and improves battery life ofthe device by enabling the user to use the device more quickly andefficiently.

The live video communication user interface (e.g., 1200) includes (1308)a first affordance (e.g., 1208, an effects affordance) (e.g., anaffordance associated with a function for activating a camera effectsmode (e.g., a mode in which various camera effects can be applied to therepresentation of a user in a live video communication session)).

In some embodiments, prior to displaying the first affordance (e.g.,1208), the electronic device (e.g., 600) detects a first input (e.g.,1205) on the live video communication user interface (e.g., 1200) (e.g.,a tap gesture on the live video communication user interface to displayvideo call options), the first input corresponding to a request todisplay one or more options (e.g., an option to end the call, an optionto switch a camera view, etc.) associated with the live videocommunication session. In some embodiments, in response to detecting thefirst input, the electronic device displays the one or more options(e.g., 1208, 1210, 1212) associated with the live video communicationsession. In some embodiments, the electronic device displays the firstaffordance (e.g., 1208).

The electronic device (e.g., 600) detects (1310) a gesture (e.g., 1213)directed to the first affordance (e.g., 1208). In response to detecting(1312) the gesture directed to the first affordance, the electronicdevice activates (1314) a camera effects mode.

In some embodiments, in response to detecting the gesture directed tothe first affordance (e.g., 1208), the electronic device (e.g., 600)displays a first visual-effect affordance associated with a first typeof visual effect and a second visual-effect affordance associated with asecond type of visual effect that is different from the first type ofvisual effect and, optionally, a third visual-effect affordanceassociated with a third type of visual effect that is different from thefirst type of visual effect and the second type of visual effect (e.g.,a sticker affordance 1214-2, an avatar affordance 1214-1, an affordanceassociated with a full-screen effect). In some embodiments, a stickeraffordance is associated with a visual effect in which a sticker isdisplayed in the representation of the subject participating in the livevideo communication session. In some embodiments, an avatar affordanceis associated with a visual effect in which a virtual avatar isdisplayed on the representation of the subject participating in the livevideo communication session. In some embodiments, a full-screen effectincludes a visual effect in which graphical objects such as confetti orballoons are displayed in front of, behind, and/or on a participant inthe live video communication session.

In some embodiments, the electronic device (e.g., 600) detects aselection (e.g., 1215) of one of the affordances (e.g., a stickeraffordance 1214-2, an avatar affordance 1214-1, an affordance associatedwith a full-screen effect) associated with a type of visual effect. Insome embodiments, in response to detecting the selection of theaffordance associated with the visual effect, the electronic devicedisplays a plurality of visual effect options (e.g., 1218) correspondingto the visual effect. Displaying a plurality of visual effect optionscorresponding to the visual effect in response to detecting a selectionof the affordance associated with the visual effect allows the user toquickly and easily access corresponding visual effect options. Providingadditional control options without cluttering the UI with additionaldisplayed controls enhances the operability of the device and makes theuser-device interface more efficient (e.g., by helping the user toprovide proper inputs and reducing user mistakes whenoperating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments, asticker affordance is associated with a visual effect that includesdisplaying a representation of a static graphical object (e.g., a hat, astar, glasses, etc.) in image data (e.g., the representation of thesubject participating in the live video communication session). In someembodiments, an avatar affordance is associated with a visual effectthat includes displaying a representation of a virtual avatar (e.g., acustomizable virtual avatar or a non-customizable virtual avatar) suchthat image data of a person's head is replaced with a graphicalrepresentation of the virtual avatar. In some embodiments, a full-screeneffect includes a visual effect in which graphical objects such asconfetti or balloons are displayed in front of, behind, and/or on aparticipant in the live video communication session.

In response to detecting (1312) the gesture directed to the firstaffordance, the electronic device (e.g., 600) increases (1316) a size ofthe representation (e.g., 1201) of the subject participating in the livevideo communication session. Increasing the size of the representationof the subject participating in the live video communication session inresponse to detecting (1312) the gesture directed to the firstaffordance enables the user to quickly and easily adjust the size of therepresentation of the subject. Providing additional control optionswithout cluttering the UI with additional displayed controls enhancesthe operability of the device and makes the user-device interface moreefficient (e.g., by helping the user to provide proper inputs andreducing user mistakes when operating/interacting with the device)which, additionally, reduces power usage and improves battery life ofthe device by enabling the user to use the device more quickly andefficiently. In some embodiments, increasing the size of therepresentation of the subject includes switching the position of thedisplayed representation of the subject with the position of a displayedparticipant in the live video communication session.

In some embodiments, while the camera effects mode is activated (1318),the electronic device (e.g., 600) detects (1320) a selection of aneffects option affordance (e.g., a selectable icon associated with afunction for displaying a visual effect in the representation of thesubject participating in the live video communication session). In someembodiments, the effects option affordance is a stickers affordance, anavatar affordance, or an affordance associated with a full-screen effectsuch as confetti or balloons.

In some embodiments, in response to detecting selection of the effectsoption affordance, the electronic device modifies (1322) an appearanceof the representation of the subject participating in the live videocommunication session based on a visual effect (e.g., displaying asticker, avatar, or full-screen effect) associated with the selectedeffects option affordance. Modifying an appearance of the representationof the subject participating in the live video communication sessionbased on a visual effect associated with the selected effects optionaffordance provides visual feedback that application of the selectedvisual effect was successful. Providing improved visual feedback to theuser enhances the operability of the device and makes the user-deviceinterface more efficient (e.g., by helping the user to provide properinputs and reducing user mistakes when operating/interacting with thedevice) which, additionally, reduces power usage and improves batterylife of the device by enabling the user to use the device more quicklyand efficiently. In some embodiments, in response to detecting selectionof a sticker affordance, the representation of the user participating inthe live video communication session is modified to display a selectedsticker. In some embodiments, in response to detecting selection of anavatar affordance, the representation of the user participating in thelive video communication session is modified to display an avatarpositioned on the face of the user. In some embodiments, in response todetecting selection of an affordance associated with a full-screeneffect, a full-screen effect is displayed in the representation of theuser participating in the live video communication session (e.g.,confetti is displayed falling in front of, behind, and on therepresentation of the user).

In some embodiments, the electronic device (e.g., 600) detects a secondinput on the live video communication user interface (e.g., 1200), thesecond input corresponding to a request to reduce the size of therepresentation of the subject participating in the live videocommunication session. In some embodiments, in response to detecting thesecond input, the electronic device concurrently displays therepresentation of the subject participating in the live videocommunication session having the modified appearance based on the visualeffect associated with the selected effects option affordance and one ormore representations of respective participants in the live videocommunication session. Concurrently displaying the representation of thesubject participating in the live video communication session and one ormore representations of respective participants in the live videocommunication session in response to detecting the second input enablesthe user to quickly and easily view (simultaneously) other participantsof the live video communication. Providing additional control optionswithout cluttering the UI with additional displayed controls enhancesthe operability of the device and makes the user-device interface moreefficient (e.g., by helping the user to provide proper inputs andreducing user mistakes when operating/interacting with the device)which, additionally, reduces power usage and improves battery life ofthe device by enabling the user to use the device more quickly andefficiently. In some embodiments, the representation of the user in thelive video communication session is reduced so that it is displayed onthe screen with representations of other participants in the live videocommunication session.

In some embodiments, while the camera effects mode is activated (1318),the electronic device (e.g., 600) modifies (1324) an appearance of therepresentation (e.g., 1201) of the subject participating in the livevideo communication session to display one or more visual effects. Insome embodiments, when the visual effect is a sticker effect, theappearance of the representation of the subject participating in thelive video communication session is modified to include display of astatic graphical object (e.g., a sticker). In some embodiments, thestatic graphical object (e.g., sticker) interacts with therepresentation of the subject participating in the live videocommunication session. In some embodiments, when the visual effect is anavatar effect, the appearance of the representation of the subjectparticipating in the live video communication session is modified todisplay a representation of a virtual avatar (e.g., a customizablevirtual avatar or a non-customizable virtual avatar) replacing thesubject's head. In some embodiments, when the visual effect is afull-screen effect, the appearance of the representation of the subjectparticipating in the live video communication session is modified todisplay graphical objects (e.g., graphical confetti or graphicalballoons) displayed in front of, behind, and/or on a participant in thelive video communication session).

In some embodiments, the modified appearance is sent/transmitted toother participants in the live video communication session. In someembodiments, transmitting the data includes transmitting the image data(e.g., a real-time stream of image data) from the field of view of thecamera) along with data (e.g., separate data) representing themodifications made based on the selected visual effect. In someembodiments, transmitting the data includes transmitting composite videodata that includes the image data from the field of view of the cameracombined with data representing the modifications made based on theselected visual effect.

In some embodiments, while the camera effects mode is activated, theelectronic device (e.g., 600) modifies an appearance of therepresentation (e.g., 1200) of the subject participating in the livevideo communication session to display a virtual avatar. In someembodiments, the electronic device detects a change in a face in a fieldof view of one or more cameras (e.g., 602) of the electronic device. Insome embodiments, the electronic device changes an appearance of thevirtual avatar based on the detected change in the face. Changing theappearance of the virtual avatar based on the detected change in theface provides visual feedback that the virtual avatar is basedon/associated with the face. Providing improved visual feedback to theuser enhances the operability of the device and makes the user-deviceinterface more efficient (e.g., by helping the user to provide properinputs and reducing user mistakes when operating/interacting with thedevice) which, additionally, reduces power usage and improves batterylife of the device by enabling the user to use the device more quicklyand efficiently. In some embodiments, the virtual avatar is modified tomirror movement of the subject participating in the live videocommunication session. In some embodiments, the change in the face isdetected using one or more depth cameras and/or depth maps, as discussedherein.

In some embodiments, while the camera effects mode is activated, theelectronic device (e.g., 600) displays a first visual effect (e.g., asticker) in the representation of the subject participating in the livevideo communication session. In some embodiments, the electronic devicedetects an input (e.g., a touch input) corresponding to the first visualeffect. In some embodiments, in response to detecting the inputcorresponding to the first visual effect, in accordance with adetermination that the input is a first type (e.g., a touch-and-draggesture), the electronic device modifies a location of the first visualeffect in the representation of the subject participating in the livevideo communication session based on a magnitude (e.g., a distance thegesture is moved) and direction of the input. In some embodiments, inresponse to detecting the input corresponding to the first visualeffect, in accordance with a determination that the input is a secondtype (e.g., a pinch or de-pinch gesture), the electronic device modifiesa size of the first visual effect based on the magnitude (e.g., theadjusted distance between the contact points of the pinch/de-pinchgesture) of the input. Modifying the location of the first visual effectin the representation of the subject participating in the live videocommunication session based on a magnitude of the input or modifying thesize of the first visual effect based on the magnitude (e.g., theadjusted distance between the contact points of the pinch/de-pinchgesture) of the input based on the type of the input enables the user toquickly and easily adjust the location or the size of a visual effect(by simply changing the type of the input). Reducing the number ofinputs needed to perform an operation enhances the operability of thedevice and makes the user-device interface more efficient (e.g., byhelping the user to provide proper inputs and reducing user mistakeswhen operating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently.

In some embodiments, modifying the location of the first visual effectincludes one or more of the following steps: prior to detectingtermination of the input (e.g., 889), displaying movement of the firstvisual effect (e.g., 858-4) based on the magnitude and direction of theinput; and in accordance with a determination that the first visualeffect (e.g., a sticker (858-4)) moves across a border region of apredetermined location (e.g., a location corresponding to a portion ofthe representation of the subject (e.g., the subject's face)),generating an indication (e.g., display a bracket (e.g., 890) orgenerate a haptic response (e.g., 892)) that the first visual effect hascrossed (e.g., or is crossing) the border region (e.g., displaying abracket around the representation of the subject's face as shown in FIG.8BG). In some embodiments, when (or after) crossing into the borderregion of the predetermined location, the device displays a bracketaround the predetermined location and/or generates a haptic feedback(e.g., a tactile output) to indicate the sticker is positioned on thesubject's face and will be placed on the subject's face upon terminationof the gesture (e.g., the sticker will have a relationship to therepresentation of the subject's face; e.g., the sticker will have abehavior based on the relationship of the sticker to the user's face).In some embodiments, when (or after) crossing out of the border regionof the predetermined location, the device displays a bracket around thepredetermined location and/or generates a haptic feedback to indicatethe sticker is no longer positioned on the subject's face and will notbe placed on the subject's face upon termination of the gesture (e.g.,the sticker will have a different relationship to the representation ofthe subject's face; e.g., the sticker will have a different behaviorbased on the relationship of the sticker to the user's face; e.g., thesticker will be positioned remote from the subject's face). In someembodiments, the indication is at least one of a haptic feedback (e.g.,a tactile output and/or an audio output) or a visual feedback (e.g., avisual representation of the border region; a bracket displayed aroundthe predetermined location).

Generating an indication that the first visual effect has crossed (or iscrossing) the border region provides visual and/or haptic feedback tothe user that the behavior and placement of the sticker has changed,without requiring the user to terminate the gesture and experiment withthe modeled behavior. Providing improved visual feedback to the userenhances the operability of the device and makes the user-deviceinterface more efficient (e.g., by helping the user to provide properinputs and reducing user mistakes when operating/interacting with thedevice) which, additionally, reduces power usage and improves batterylife of the device by enabling the user to use the device more quicklyand efficiently.

In some embodiments, modifying the location of the first visual effectincludes displaying movement of the first visual effect from a firstlocation to a second location. In some embodiments, modifying the sizeof the first visual effect includes displaying a transition of the firstvisual effect from a first displayed size to a second displayed size. Insome embodiments, when a sticker is being moved on the display orresized, the sticker movement and/or resizing is displayed such that theuser and other participants in the live video communication session cansee the changes, including the intermediate movement/resizing as thesticker is being modified.

In some embodiments, modifying the location of the first visual effectincludes transitioning the first visual effect from appearing at a firstlocation to appearing at a second location, without displaying the firstvisual effect appearing at an intermediate location. In someembodiments, modifying the size of the first visual effect includestransitioning the first visual effect from a first displayed size to asecond displayed size, without displaying the first visual effect havingan intermediate size. In some embodiments, when a sticker is being movedon the display or resized, the sticker movement and/or resizing isdisplayed such that only the user can see the changes, including theintermediate movement/resizing as the sticker is being modified, butother participants in the live video communication session cannot seethe intermediate movement/resizing of the sticker. Thus, otherparticipants only see the sticker (or updates to the sticker) after ithas been modified.

In some embodiments, a plurality of participants are participating inthe live video communication session, the plurality of participantsincluding the subject (e.g., a user of the electronic device) and afirst remote participant (e.g., a user of a second electronic device,remote from the first electronic device. In some embodiments, the livevideo communication user interface (e.g., 1200) further includes arepresentation of the first remote participant. In some embodiments, therepresentation of the first remote participant includes image or videodata received from a remote device/a remote camera. In some embodiments,further in response to detecting the gesture directed to the firstaffordance, the electronic device reduces a size of the representationof the first remote participant.

In some embodiments, a plurality of participants are participating inthe live video communication session, the plurality of participantsincluding the subject (e.g., a user of the electronic device) and afirst remote participant (e.g., a user of a second electronic device,remote from the first electronic device), and where the representationof the subject is a live preview of a field of view of a camera (e.g.,602) of the electronic device (e.g., 600) (e.g., a stream of image datathat represents what is in the field of view of the camera). In someembodiments, after modifying the appearance of the representation (e.g.,1201) of the subject participating in the live video communicationsession based on a visual effect associated with the selected effectsoption affordance, the electronic device transmits data corresponding tothe modified appearance of the representation of the subjectparticipating in the live video communication session to at least theremote participant of the plurality of participants. In someembodiments, transmitting the data includes transmitting the image data(e.g., a real-time stream of image data) from the field of view of thecamera) along with data (e.g., separate data) representing themodifications made based on the selected visual effect. In someembodiments, transmitting the data includes transmitting composite videodata that includes the image data from the field of view of the cameracombined with data representing the modifications made based on theselected visual effect.

In some embodiments, the electronic device (e.g., 600) displays the livevideo communication user interface (e.g., 1200) without the firstaffordance. In some embodiments, the electronic device detects a touchinput on the live video communication user interface (e.g., 1206) (e.g.,a tap gesture on the live video communication user interface to displayvideo call options). In some embodiments, in response to detecting thetouch input, in accordance with a determination that the camera effectsmode is activated, the electronic device displays a live videocommunication options user interface including the first affordance anda plurality of visual effects affordances (e.g., a sticker affordance,an avatar affordance, an affordance associated with a full-screeneffect). In some embodiments, in response to detecting the touch input,in accordance with a determination that the camera effects mode is notactivated, the electronic device displays the live video communicationoptions user interface including the first affordance and excluding theplurality of visual effects affordances. Displaying the live videocommunication options user interface including the first affordance andeither including the plurality of visual effects affordances orexcluding the plurality of visual effects affordances based on adetermination that the camera effects mode is or is not activatedindicates whether or not the camera effects mode is currently activated.Performing an operation when a set of conditions has been met withoutrequiring further user input enhances the operability of the device andmakes the user-device interface more efficient (e.g., by helping theuser to provide proper inputs and reducing user mistakes whenoperating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently.

Note that details of the processes described above with respect tomethod 1300 (e.g., FIGS. 13A-13B) are also applicable in an analogousmanner to the methods described above and below. For example, method 700optionally includes one or more of the characteristics of the variousmethods described above with reference to method 1300. For example,visual effects such as stickers and virtual avatars are displayed inimage data in a messaging application user interface. For anotherexample, method 900 optionally includes one or more of thecharacteristics of the various methods described above with reference tomethod 1300. For example, visual effects such as stickers and virtualavatars are displayed in image data in a camera user interface. Foranother example, method 1100 optionally includes one or more of thecharacteristics of the various methods described above with reference tomethod 1300. For example, visual effects such as stickers and virtualavatars are displayed in image data in a media user interface. Foranother example, method 1500 optionally includes one or more of thecharacteristics of the various methods described above with reference tomethod 1300. For example, visual effects such as stickers and virtualavatars are displayed in image data for a camera user interface. Forbrevity, these details are not repeated below.

FIGS. 14A-14M illustrate exemplary user interfaces for displaying visualeffects in a camera application, in accordance with some embodiments.The user interfaces in these figures are used to illustrate theprocesses described below, including the processes in FIGS. 15A-15B.

FIG. 14A shows an embodiment of a camera application user interface 1405similar to the camera application user interface 815 discussed abovewith respect to FIGS. 8A-8BQ. Camera application user interface 1405includes image display region 1420 (similar to image display region 820)which displays a representation of image data such as, for example,streamed image data (e.g., a live camera preview, live camera recording,or live video communications session) representing objects positionedwithin a field-of-view of a camera (e.g., a rear-facing camera or camera602), or a media item such as, for example, a photograph or a videorecording. In the embodiment illustrated in FIG. 14A, image displayregion 1420 shows live camera preview 1420-1 from camera 602.

The embodiment illustrated in FIG. 14A is similar to that shown in FIG.8F. Image display region 1420 shows a representation of subject 1422positioned in the field of view of camera 602 and background 1426displayed behind the representation of subject 1422. As discussedherein, image data captured using camera 602 includes, in someembodiments, depth data that can be used to determine a depth of objectsin the field of view of camera 602. In some embodiments, device 600parses objects (e.g., in image data) based on a detected depth of thoseobjects, and uses this determination to apply the visual effectsdiscussed herein. For example, device 600 can categorize representationof subject 1422 as being in the foreground of the live camera preview1420-1 and objects positioned behind the user as being in the backgroundof the live camera preview 1420-1. These background objects are referredto generally herein as background 1426.

In FIGS. 14A and 14B, representation of subject 1422 is shown wearinghat 1423, with a bill that sticks out in front of subject 1422, and noavatar is displayed on the representation of subject 1422. Device 600displays avatar options menu 1408 (similar to avatar options menu 828)having avatar options 1410 that can be selected to display a virtualavatar in image display region 1420.

In FIG. 14B, device 600 detects input 1411 on avatar option 1410-1. Inresponse, device 600 displays, in FIG. 14C, avatar option 1410-1 inselection region 1429 and generates haptic feedback 1412 (e.g., atactile output, with or without an audio output) to indicate that avataroption 1410-1 is selected.

As shown in FIG. 14C, device 600 also updates live camera preview 1420-1to display virtual avatar 1421 corresponding to selected avatar option1410-1. Avatar 1421 is displayed overlaying portions of therepresentation of subject 1422, including subject's hat 1423, whilemaintaining display of other portions of representation of subject 1422and background 1426. As shown in FIGS. 14A-14K, device 600 dynamicallymodifies avatar 1421 to hide or display certain portions of avatar 1421depending on a spatial relationship determined for representation ofsubject 1422 and avatar 1421 (e.g., based on detected movements of thesubject). This gives the appearance of avatar 1421 being physicallyplaced onto the head of representation of subject 1422.

In some embodiments, the dynamic modification of avatar 1421 is achievedusing one or more depth sensors (e.g., depth camera sensor 175) tocapture an initial depth map of the objects in the field of view ofcamera 602 (including the subject (corresponding to representation ofsubject 1422) and background (corresponding to background 1426)). Theinitial depth map is then modified (e.g., using one or more of ablurring, fading, or smoothing transition of the initial depth map) todecrease instances of abrupt transitions between displaying and hidingportions of the avatar. This provides a more fluid, dynamic appearanceof avatar 1421, particularly as various portions of the avatar arehidden or displayed in response to movement of the subject.

In FIG. 14C, avatar 1421 includes shadow 1430 positioned under theavatar head and displayed on the representation of the subject's neck,to represent a shadow cast by the presence of avatar 1421 onrepresentation of subject 1422. In some embodiments, a position ofshadow 1430 is determined based on a shape of the avatar and a relativeposition of representation of subject 1422, avatar 1421, and a lightsource (e.g., a light source detected in the field of view of camera 602or a simulated light source).

Device 600 displays avatar 1421 having long hair that hangs in front of,and behind, the representation of the subject's shoulders. The positionof certain portions of the hair, relative to the representation of thesubject's shoulders, is determined based on depth data that indicatesthe spatial positioning of avatar 1421 (including the avatar hair)relative to the depth position of representation of subject 1422 (andspecific portions of representation of subject 1422 (e.g.,representations of the subject's neck and/or shoulders)). In someembodiments, portions of the avatar that are dynamically displayed(e.g., portions of the avatar that can be either displayed or hiddendepending on the spatial relationship with representation of subject1422) are shown having a blending effect 1432 at locations adjacent torepresentation of subject 1422. This blending effect smooths a displayedtransition between the portion of the avatar and the representation ofsubject 1422.

Device 600 modifies avatar 1421 in response to detected changes in thehead and face of the subject. For example, as shown in FIGS. 14D-14F,device 600 detects the subject's head turned to the side, while winkingand smiling and, in response, modifies avatar 1421 to reflect these samemovements (e.g., avatar 1421 is shown with the avatar head turned to theside with a wink and smile), while also adjusting the position of shadow1430 based on the movement of the avatar. Movement of the avatar headaffects the spatial position of the avatar hair relative torepresentation of subject 1422. As a result, device 600 dynamicallymodifies some portions of the avatar based on the changed spatialrelationship. For example, as the head turns, device 600 displays someof the avatar hair moving back over the representation of the subject'sshoulder and/or behind the representation of the subject's neck (e.g.,device 600 hides some portions of the avatar hair), as indicated atreference number 1424-1. Conversely, as the avatar head turns, device600 also displays (e.g., reveals) portions of the avatar hair that werepreviously hidden prior to turning the avatar head (e.g., they werepreviously hidden behind the representation of the subject's neck orshoulder), as indicated at reference number 1424-2. Thus, device 600hides or displays dynamic portions of the avatar (such as portions ofavatar hair) based on the change in position of the user, whichcorrespondingly affects movement of the avatar.

In some embodiments, portions of avatar 1421 are persistently displayedregardless of any spatial relationship to representation of subject 1422or any other objects in the depth map. For example, althoughrepresentation of subject 1422 is wearing hat 1423, which includes abill that sticks out in front of representation of subject 1422, avatarhead 1421-3 and avatar face 1421-4 are persistently displayed in frontof the representation of the subject's head and hat 1423. This preventsobjects in the field of view of camera 602, particularly objects onrepresentation of subject 1422 (or portions of the representation of thesubject), from appearing through portions of avatar 1421 (e.g.,specifically, portions of avatar 1421 that should always be displayed torender an appearance of avatar 1421 positioned on representation ofsubject 1422). In some embodiments, the persistently displayed portionsof avatar 1421 can include the avatar's face (1421-4), head (1421-3),and portions of the avatar's hair (1421-1).

As another example, FIG. 14D shows a representation of subject's hand1425 held out in front of representation of subject 1422 towards camera602. Although representation of hand 1425 is positioned far in front ofrepresentation of subject 1422, portion 1421-1 of avatar hair isdisplayed over of a portion of representation of hand 1425, even thoughthe spatial position of representation of hand 1425 is clearly closer tocamera 602 than the avatar hair portion 1421-1. To be clear, if avatarhair portion 1421-1 was dynamic (e.g., not persistently displayed), itwould be hidden behind representation of hand 1425 in FIG. 14D similarto the dynamic portions of the avatar hair (e.g., 1421-2), which arediscussed below.

FIG. 14E shows representation of hand 1425 moved back towards subject1422, close to the representation of the subject's shoulder. Asrepresentation of hand 1425 moves closer to representation of subject1422, device 600 determines that the spatial position of representationof hand 1425 moves into the dynamic portion of avatar hair. As a result,device 600 displays additional avatar hair 1421-2 positioned in front ofrepresentation of hand 1425. Similarly, FIG. 14F shows representation ofhand 1425 positioned even closer towards representation of subject 1422and a greater amount of hair displayed in front of representation ofhand 1425. In FIGS. 14D-14F, the amount of hair positioned in front ofrepresentation of subject's right shoulder 1422-1 does not changebecause the spatial relationship of the avatar hair and representationof subject's right shoulder 1422-1 does not change after the head isturned.

In FIG. 14G, the subject returns to a forward facing position with aneutral facial expression, and device 600 modifies avatar 1421accordingly.

In FIG. 14H, the subject looks up, and device 600 modifies avatar 1421accordingly. Device 600 shows avatar 1421 with a head tilting up toreveal the underside of the avatar's head, which includes blendingeffect 1434 at a location where the representation of the subject's neckintersects avatar 1421. In some embodiments, blending effect 1434 issimilar to blending effect 1432. In FIG. 14H, shadow 1430 is alsoadjusted (e.g., moved under the avatar's chin) based on the tiltedposition of the avatar's head. When avatar 1421 is looking up, avatarhair positioned behind the representation of the subject's neck remainshidden behind the representation of the subject's neck and does notprotrude through the representation of the subject's neck.

FIGS. 141-14J illustrate selecting a different avatar in accordance withan embodiment of the present disclosure.

In FIGS. 14K-14M, device 600 shows representation of subject 1422 withunicorn avatar 1435 positioned over the representation of the subject'shead and neck. Unicorn avatar 1435 includes a head portion 1435-1 and aneck portion 1435-2 including mane 1435-3. Neck portion 1435-2 isdisplayed around the representation of the subject's neck so that therepresentation of the subject's neck is not displayed. Blending effect1438 is displayed where the unicorn neck portion 1435-2 meets the baseof the representation of the subject's neck. Blending effect 1438 issimilar to the other blending effects discussed herein.

Unicorn avatar 1435 also includes shadow 1436 displayed on a portion ofunicorn avatar 1435 and on representation of subject 1432 (e.g., arepresentation of a shadow cast onto the representation of the subjectby the avatar). In some embodiments, a displayed shadow has a shape andposition determined based on the shape of the avatar and a relativeposition of the avatar and representation of subject 1432 to a lightsource (e.g., a light source detected in the field of view of camera 602or a simulated light source). As shown in FIGS. 14K-14M, shadow 1436moves in response to movement of unicorn avatar 1435. In FIG. 14K,shadow 1436 has an elongated shape (due to the elongated shape of theunicorn avatar's face) and is positioned on neck portion 1435-2 and thechest of representation of subject 1422. In FIG. 14L, head portion1435-1 is turned to the side to reveal additional neck portion 1435-2and mane 1435-3. A portion 1435-4 of mane 1435-3 that was positioned onthe backside of unicorn avatar 1435 in FIG. 14K is now shown positionedin front of subject's shoulder 1422-1 and avatar neck portion 1435-2.Shadow 1436 is moved to the side to be positioned under head portion1435-1 (e.g., on neck portion 1435-2), and partly on the shoulder ofrepresentation of subject 1422. In FIG. 14M, head portion 1435-1 isfacing forward and tilted up, and shadow 1436 is positioned under headportion 1435-1 (on neck portion 1435-2) and having a reduced size due tothe upward tilt of head portion 1435-1. Additionally, when the head istilted up, the mane 1435-3 remains positioned on the backside of theneck portion 1435-2 and is not displayed through the neck portion.

FIGS. 15A-15B are a flow diagram illustrating a method for displayingvisual effects in a camera application using an electronic device inaccordance with some embodiments. Method 1500 is performed at a device(e.g., 100, 300, 500, 600) with one or more cameras and a displayapparatus. Some operations in method 1500 are, optionally, combined, theorders of some operations are, optionally, changed, and some operationsare, optionally, omitted.

As described below, method 1500 provides an intuitive way for displayingvisual effects in a camera application. The method reduces the cognitiveburden on a user for applying visual effects to an image viewed in acamera application, thereby creating a more efficient human-machineinterface. For battery-operated computing devices, enabling a user todisplay visual effects in an image faster and more efficiently conservespower and increases the time between battery charges.

The electronic device (e.g., 600) displays (1502), via the displayapparatus (e.g., 601), a representation of image data (e.g., 1420-1)captured via the one or more cameras (e.g., 602). In some embodiments,the representation includes a representation of a subject (e.g., 1422)(e.g., a representation of at least a portion of a subject) and theimage data corresponds to depth data (e.g., the image data includes datacaptured by a visible light camera and a depth camera) that includesdepth data for the subject (e.g., information about the relative depthpositioning of one or more portions of the subject with respect to otherportions of the subject and/or to other objects within the field of viewof the one or more cameras). In some embodiments, depth data is in theform of a depth map or depth mask.

In some embodiments, the electronic device (e.g., 600) includes one ormore depth sensors (e.g., 175, 602). In some embodiments, prior todisplaying a representation of the virtual avatar (e.g., 1421), theelectronic device captures initial depth data (e.g., an initial orunmodified depth map and/or depth mask corresponding to the image datacaptured by the one or more cameras (e.g., 602); an initial orunmodified depth mask of the subject) for the subject via the one ormore depth sensors. The electronic device generates the depth data forthe subject by modifying the initial depth data for the subject. In someembodiments, modifying the initial depth data can decrease instances ofabrupt transitions between including and excluding the representation ofthe first portion of the virtual avatar (e.g., 1421-2), particularly asthe pose of the subject changes with respect to the electronic device.Modifying the initial depth data for the subject allows for smoothertransitions in displaying the representation of the virtual avatar asthe pose of the user changes, thereby improving the visual feedback ofdetected changes in the subject (represented by the correspondingchanges to the virtual avatar). Providing improved visual feedback tothe user enhances the operability of the device and makes theuser-device interface more efficient (e.g., by helping the user toprovide proper inputs and reducing user mistakes whenoperating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently.

In some embodiments, modifying the initial depth data for the subjectincludes performing one or more transformations on the initial depthdata selected from the group consisting of blurring (e.g., defocusingthe initial depth data to blend the transitions between different levelsof the depth data; e.g., blurring the values (e.g., greyscale values) ofan initial depth mask) the initial depth data, fading out (e.g.,modulating the depth data downwards to reduce the depth values) to theinitial depth data, and smoothing (e.g., applying a mathematicalfunction to blend the initial depth data, particularly at thetransitions between different depth layers of the initial depth data)the initial depth data.

The electronic device (e.g., 600) displays (1504), via the displayapparatus (e.g., 601), a representation of a virtual avatar (e.g., 1421)(e.g., a visual representation of a virtual avatar construct that caninclude some or all of the construct, when represented) that isdisplayed in place of (e.g., occludes or is displayed on top of) atleast a portion of (e.g., with at least a portion of the virtual avatarpartially or completely overlaying (e.g., obscuring) at least a portionof the subject) the representation of the subject (e.g., 1422, 1423).Displaying a visual representation of the virtual avatar over at least aportion of the representation of the subject provides the user withvisual feedback of how the virtual avatar looks when overlaid thesubject. Providing improved visual feedback to the user enhances theoperability of the device and makes the user-device interface moreefficient (e.g., by helping the user to provide proper inputs andreducing user mistakes when operating/interacting with the device)which, additionally, reduces power usage and improves battery life ofthe device by enabling the user to use the device more quickly andefficiently. In some embodiments, the virtual avatar is placed at asimulated depth (e.g., at a location selected so that the virtual avataris displayed slightly in front of the representation of the subject in adepth dimension of the user interface) relative to the representation ofthe subject as determined based on the depth data for the subject.

In some embodiments, in accordance with a determination, based on thedepth data, that a first portion of the virtual avatar (e.g., 1421-2)(e.g., an avatar hair portion) satisfies a set of depth-based displaycriteria, device (e.g., 600) includes (1506) as part of therepresentation of the virtual avatar (e.g., 1421), a representation ofthe first portion of the virtual avatar (e.g., 1421-2) that is displayedin place of the first portion of the subject (e.g., the first portion ofthe representation of the subject) (for example, a portion 1421-2 ofavatar hair is displayed over a portion of a representation of subject'shand 1425 as shown in FIG. 14E). When a first portion of the virtualavatar satisfies depth-based display criteria because it is not obscuredby a portion of the subject, the first portion of the virtual avatar isdisplayed. Determining whether the first portion of the virtual avataris obscured prior to displaying it allows the user-device interface tobe more efficient by only displaying portions of the virtual avatar thatwill be visible to the user. Providing visual feedback of the visibleportion of the virtual avatar allows the user to see the resulting imagewhen the virtual avatar is overlaid over the subject. Providing improvedvisual feedback to the user enhances the operability of the device andmakes the user-device interface more efficient (e.g., by helping theuser to provide proper inputs and reducing user mistakes whenoperating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments, thedepth-based display criteria include a requirement that the depth datafor the subject indicate that the first portion of the virtual avatarhas a simulated depth that is in front of a corresponding first portion(e.g., a neck, shoulders and/or body) of the subject (e.g., acorresponding first portion of the representation of the subject), inorder for the depth-based display criteria to be met (e.g., the set ofdepth-based criteria includes a criterion that is satisfied when theportion of the virtual avatar is to be displayed at a position that isnot obscured by a portion of the subject (e.g., an ear of the subject)).

In some embodiments, in accordance with a determination, based on thedepth data, that the first portion of the virtual avatar (e.g., 1421-2)does not satisfy the set of depth-based display criteria for the firstportion of the subject (e.g., 1425) (e.g., because the depth data forthe subject indicate that the first portion of the virtual avatar has asimulated depth that is behind the corresponding first portion of thesubject), the device (e.g., 600) excludes (1508), from therepresentation of the virtual avatar (e.g., 1421), the representation ofthe first portion of the virtual avatar (e.g., hair is not displayedbecause it is positioned behind the subject's shoulder 1422-1 at region1424-1) (e.g., additional avatar hair 1421-2 is not shown in FIG. 14Dbecause it is positioned behind hand 1425). In some embodiments, theelectronic device also displays the first portion of the subject (e.g.,the first portion of the representation of the subject) in the regionthat would have been occupied by the first portion of the virtual avatar(e.g., forgo including as part of the representation of the virtualavatar, a representation of the first portion of the virtual avatarbecause that portion should be obscured by the subject). When the firstportion of the virtual avatar does not satisfy depth-based displaycriteria because it is obscured by a portion of the subject, the firstportion of the virtual avatar is excluded from the displayedrepresentation of the virtual avatar. Determining whether the firstportion of the virtual avatar is obscured prior to displaying it allowsthe user-device interface to be more efficient by excluding portions ofthe virtual avatar that will not be visible to the user. Providingvisual feedback of the virtual avatar allows the user to see theresulting image when the virtual avatar is overlaid over the subject.Providing improved visual feedback to the user enhances the operabilityof the device and makes the user-device interface more efficient (e.g.,by helping the user to provide proper inputs and reducing user mistakeswhen operating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently.

In some embodiments, the first portion of the virtual avatar (e.g.,1421) (e.g., an avatar head) moves based on movement of the subject. Insome embodiments, the first portion of the virtual avatar moves based onthe movement of the subject's head or the representation of thesubject's head.

In some embodiments, the representation of the virtual avatar includes arepresentation of a second portion (e.g., 1421-1, 1421-3, 1421-4,1435-1) (e.g., a top of an avatar head (1421-3)) of the virtual avatarthat is displayed over a second portion (e.g., 1425, 1423) of therepresentation of the subject without regard to whether or not the depthdata indicate that the second portion of the virtual avatar has asimulated depth that is in front of or behind the corresponding secondportion of the representation of the subject. A representation of asecond portion of the virtual avatar such as the top of the avatar'shead is persistently displayed. This allows the second portion of thevirtual avatar to always be displayed, even if the representation of thesubject includes an object that is positioned closer to the camera(e.g., 602) than the avatar (e.g., a hat (1423) positioned on therepresentation of the subject's head will be covered by the avatar(1421)). Persistently displaying a portion of the virtual avatarprovides the user with more control of the device by allowing the userto display a selected avatar without having to adjust depth settings ofthe device to ignore certain objects. Providing additional control ofthe device without cluttering the UI with additional displayed controlsenhances the operability of the device and makes the user-deviceinterface more efficient (e.g., by helping the user to provide properinputs and reducing user mistakes when operating/interacting with thedevice) which, additionally, reduces power usage and improves batterylife of the device by enabling the user to use the device more quicklyand efficiently. In some embodiments, the second portion of the virtualavatar (e.g., an avatar head) is persistently displayed in place of thecorresponding second portion of the representation of the subject. Insome embodiments, portions of the avatar that are displayed irrespectiveof depth-based criteria are persistently displayed over the subject toavoid displaying portions of the representation of the subject (e.g., ahat (1423), the subject's hair) protruding through the virtual avatar,even when a spatial relationship of the portions of the virtual avatarand the portions of the representation of the subject would otherwiseindicate that the portions of the virtual avatar should be obscured bythe portions of the subject. In some embodiments, the second portion ofthe virtual avatar (e.g., an avatar head) moves based on movement of thesubject (e.g., 1422) (e.g., based on movement of the subject's head orbased on movement of the representation of the subject's head).

In some embodiments, the first portion of the virtual avatar (e.g.,1421-2) (e.g., a portion that is included or excluded based on depthdata) is a first sub-portion of a first avatar feature (e.g., an elementof the virtual avatar such as avatar hair, an avatar ear, an avataraccessory (e.g., avatar earrings)) and the second portion (e.g., 1421-1,1421-3, 1421-4, 1435-1, 1435-2) of the virtual avatar (e.g., a portionthat is not included or excluded based on depth data; that is includedindependent of the depth data) is a second sub-portion of the firstavatar feature (e.g., avatar hair). In some embodiments, the firstsub-portion is a portion (e.g., 1435-3) of the virtual avatar (e.g.,1435) that is positioned on the backside of the virtual avatar when thevirtual avatar is in a neutral position (e.g., as shown in FIG. 14K)(e.g., a displayed position of the virtual avatar when no changes aredetected in the pose of the subject's face or head) facing forward(e.g., facing the camera). In some embodiments, the second sub-portion(e.g., 1435-2) is a portion of the virtual avatar that is positioned onthe front side of the virtual avatar or is otherwise not positioned onthe backside of the virtual avatar when the virtual avatar is in aneutral position facing forward. In some embodiments, the first avatarfeature is avatar hair, the second sub-portion is a front portion of theavatar hair (e.g., 1421-1) (e.g., bangs) displayed on the front of thevirtual avatar's head (e.g., 1421-3), and the first sub-portion is aback portion of the avatar hair (e.g., long hair in the back) that is atleast partially obscured by a portion of the subject (e.g., 1422), suchas the subject's neck and/or shoulders (e.g., 1422-1).

In some embodiments, the virtual avatar (e.g., 1421, 1435) includes anavatar hair feature (e.g., avatar hair that is long) that includes thefirst portion (e.g., 1421-1) of the virtual avatar. The electronicdevice (e.g., 600) displays the representation of the virtual avatar bydisplaying a first portion of the avatar hair feature (e.g., 1421-1) andconditionally displays a second portion of the avatar hair feature(e.g., 1421-2) based on whether or not a simulated depth of the secondportion of the avatar hair feature is in front of or behind a thirdportion of the representation of the subject (e.g., 1422-1) (e.g., neck,shoulders, and/or body) based on the depth data for the subject (e.g.,displaying a representation of a persistent portion of the avatar hairfeature and, variably including (or excluding) depending on depth, thefirst portion of the virtual avatar). Determining whether the firstportion of the avatar hair feature and a second portion of the hairfeature if it is in front of or behind a third portion of therepresentation of the subject, such as the neck, shoulders or body ofthe subject. Determining the visibility of the second portion of thehair feature prior to displaying allows the user-device interface to bemore efficient in only displaying portions of the avatar hair featurethat will be visible to the user. Providing visual feedback of thevirtual avatar allows the user to see the resulting image when theavatar hair feature is displayed with the representation of the subject.Providing improved visual feedback to the user enhances the operabilityof the device and makes the user-device interface more efficient (e.g.,by helping the user to provide proper inputs and reducing user mistakeswhen operating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently.

In some embodiments, the virtual avatar includes an avatar neck feature(e.g., 1435-3) (e.g., a neck of an equine avatar (e.g., unicorn orhorse); e.g., an avatar mane) that includes the first portion (e.g.,1435-4) of the virtual avatar. The electronic device displays therepresentation of the virtual avatar (e.g., 1435) by conditionallydisplaying a portion of the avatar neck feature based on whether or nota simulated depth of the portion of the avatar neck feature is in frontof or behind a fourth portion (e.g., 1422-1) of the representation ofthe subject (e.g., neck or shoulder) based on the depth data for thesubject (e.g., displaying a representation of a persistent portion ofthe avatar neck feature and, variably including (or excluding) dependingon depth, the first portion of the virtual avatar). Determining whetherthe portion of the avatar neck feature if it is in front of or behind afourth portion of the representation of the subject, such as the neck ofthe subject. Determining the visibility of the portion of the neckfeature prior to displaying allows the user-device interface to be moreefficient in only displaying portions of the avatar neck feature thatwill be visible to the user. Providing visual feedback of the virtualavatar allows the user to see the resulting image when the avatar hairfeature is displayed with the representation of the subject. Providingimproved visual feedback to the user enhances the operability of thedevice and makes the user-device interface more efficient (e.g., byhelping the user to provide proper inputs and reducing user mistakeswhen operating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently.

In some embodiments, the first portion (e.g., 1421-2, 1435-3) of thevirtual avatar (e.g., 1421, 1435) includes an obscured portion (e.g.,1435-4) of the virtual avatar (e.g., the back of the avatar's neck) thatis not displayed when the portion of the representation of the subject(e.g., the subject's head) has a pose that (directly) faces the one ormore cameras (e.g., the subject's head is positioned forward, facing thecamera). Obscured portions of the avatar are not displayed because theuser would not be able to see that portion of the avatar. Determiningthe visibility of the first portion of virtual avatar prior todisplaying allows the user-device interface to be more efficient in onlydisplaying portions of the avatar neck feature that will be visible tothe user. Providing visual feedback of the virtual avatar allows theuser to see the resulting image when the avatar hair feature isdisplayed with the representation of the subject. Providing improvedvisual feedback to the user enhances the operability of the device andmakes the user-device interface more efficient (e.g., by helping theuser to provide proper inputs and reducing user mistakes whenoperating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments, theobscured portion of the virtual avatar includes the back of the avatar'sneck (e.g., 1435-2, 1435-3) or portions of the virtual avatar (e.g.,back of avatar hair) that are positioned behind the subject's neck orhead. In some embodiments, this prevents the back of the avatar's neck(or portions of the avatar on the back of the avatar head or positionedbehind the subject's neck) from being displayed protruding through therepresentation of the subject's neck when the subject's head is tiltedup (e.g., looking up).

In some embodiments, displaying the representation of the virtual avatar(e.g., 1421) further includes modifying the visual appearance (e.g.,blending, blurring, feathering, or otherwise gradually changing thedegree of hiding) of a third portion (e.g., 1432) of the virtual avatarthat is adjacent to the first portion (e.g., 1421-1) of the virtualavatar (e.g., and also adjacent at least a portion of the representationof the subject (e.g., 1422-1)) to an appearance that is based on boththe appearance of the avatar and the appearance of the representation ofthe subject. In some embodiments, a portion of avatar hair (e.g., 1432)is blended with the representation of the subject (e.g., 1422) at aportion of the representation of the virtual avatar where the portion ofthe avatar hair intersects the shoulders (1422-1) of the displayedrepresentation of the subject. In some embodiments, a bottom portion ofthe avatar head is blended (e.g., 1434) with the representation of thesubject at a portion of the representation of the virtual avatar wherethe bottom portion of the avatar head intersects the displayedrepresentation of the subject's neck (e.g., FIG. 14H).

In some embodiments, the electronic device (e.g., 600) detects a changein pose of a head portion of the subject (e.g., 1422) (e.g., thesubject's head turns to the side). In response to the electronic devicedetecting the change in pose of the head portion of the subject, theelectronic device modifies (e.g., increasing or decreasing), based onthe depth data and the change in pose, an amount of the virtual avatar(e.g., 1421-2) that is excluded from (e.g., a size that is eitherincluded or excluded from the representation of the virtual avatar) therepresentation of the virtual avatar (e.g., the avatar's hair). Updatingthe displayed virtual avatar based on changes in the pose of the headportion of the subject provides visual feedback of the virtual avatar.Providing improved visual feedback to the user enhances the operabilityof the device and makes the user-device interface more efficient (e.g.,by helping the user to provide proper inputs and reducing user mistakeswhen operating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments,modification includes increasing or decreasing an amount of the avatar'shair that is displayed in the representation of the virtual avatar(e.g., 1421) when the avatar's head (e.g., 1421-3) is turned to the sideto match the movement of the subject's head. In some embodiments, thedisplayed amount of the first portion of the avatar (e.g., the avatar'shair) is modified depending on whether the portion of the avatar isobscured by a portion of the representation of the subject in responseto the change in pose. For example, a displayed amount (e.g., size) of aportion of the avatar's hair is decreased when the portion of the avatarhair is obscured by the user's neck or shoulders (e.g., 1422-1) inresponse to turning the avatar's head (e.g., turning the avatar's headcauses previously displayed hair positioned in front of the subject'sshoulders to no longer be displayed because turning the head positionedthe avatar hair behind the subject's shoulders (e.g., 1424-1)).Additionally, a displayed amount of the portion of the avatar's hairincreases when the portion of the hair (e.g., 1424-2) that waspreviously hidden behind the subject's shoulders, neck, or head isvisible as a result of the avatar head turning to the side (e.g., hairpositioned behind the subject's shoulders is now visible because turningthe avatar's head caused the avatar hair to be positioned in front ofthe subject's shoulders).

In some embodiments, the device (e.g., 600) detects (1510) a change inpose of the subject. In response to detecting the change in pose of thesubject (e.g., 1422) (e.g., detecting a movement of a hand (e.g., 1425)over the user's shoulder (e.g., 1422-1); e.g., detecting a turning ortilting of the subject's head), the electronic device (e.g., 600)modifies (1512) the displayed representation of the virtual avatar(e.g., 1421) based on the change in pose. Updating the displayed virtualavatar based on changes in the pose of the subject provides visualfeedback of the virtual avatar. Providing improved visual feedback tothe user enhances the operability of the device and makes theuser-device interface more efficient (e.g., by helping the user toprovide proper inputs and reducing user mistakes whenoperating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently. In some embodiments, inaccordance with a determination, based on the depth data that takes intoaccount the change in pose of the subject, that the first portion (e.g.,1421-2) of the virtual avatar (e.g., 1421) satisfies the set ofdepth-based display criteria, the electronic device updates (1514) anappearance of the representation of the virtual avatar from a firstappearance (e.g., FIG. 14D) that excludes the first portion (e.g.,1421-2) of the virtual avatar (e.g., an appearance in which the firstportion of the virtual avatar is not displayed because it is obscured bythe representation of the subject (e.g., 1425) (e.g., a portion of theavatar's hair is hidden behind the subject's hand, neck, and/orshoulders) to a second appearance (e.g., FIG. 14E) that includes thefirst portion of the virtual avatar. In some embodiments, in accordancewith a determination, based on the depth data that takes into accountthe change in pose of the subject, that the first portion of the virtualavatar does not satisfy the set of depth-based display criteria, theelectronic device updates (1516) the appearance of the representation ofthe virtual avatar from a third appearance (e.g., FIG. 14C) thatincludes the first portion of the virtual avatar to a fourth appearance(e.g., FIG. 14D) that excludes the first portion of the virtual avatar(e.g., 1424-1 shows a reduction in the amount of displayed hair). Insome embodiments, the first portion of the virtual avatar is dynamicallyhidden based on the subject's movement (e.g., a hidden portion of theavatar can become displayed, and a displayed portion of the avatar canbecome hidden, based on the subject's movement).

In some embodiments, the electronic device (e.g., 600) detects a change(e.g., a change in pose (e.g., orientation, rotation, translation,etc.); e.g., a change in a facial expression) in the portion of therepresentation of the subject (e.g., 1422). The electronic devicechanges an appearance of the virtual avatar (e.g., 1421, 1435) based onthe detected change in the portion of the representation of the subject(e.g., modifying, in real time, a position and/or facial expression ofthe virtual avatar based on the detected change in the portion of therepresentation of the subject). Updating the displayed virtual avatarbased on changes in the expressions of the subject provides visualfeedback of the virtual avatar. Providing improved visual feedback tothe user enhances the operability of the device and makes theuser-device interface more efficient (e.g., by helping the user toprovide proper inputs and reducing user mistakes whenoperating/interacting with the device) which, additionally, reducespower usage and improves battery life of the device by enabling the userto use the device more quickly and efficiently.

In some embodiments, the electronic device (e.g., 600) displays, via thedisplay apparatus (e.g., 601), a representation of a shadow (e.g., 1430,1436) cast by the virtual avatar (e.g., 1421, 1435) that is displayed onat least a fifth portion (e.g., the subject's chest, neck, or shoulder)of the representation of the subject. The device displays arepresentation of a shadow cast by the virtual representation over aportion of the representation of the subject to provide a more realisticrepresentation of the displayed virtual avatar with a simulated lightsource. Providing visual feedback of the virtual avatar allows the userto see the resulting image. Providing improved visual feedback to theuser enhances the operability of the device and makes the user-deviceinterface more efficient (e.g., by helping the user to provide properinputs and reducing user mistakes when operating/interacting with thedevice) which, additionally, reduces power usage and improves batterylife of the device by enabling the user to use the device more quicklyand efficiently. In some embodiments, the representation of the shadowcast by the virtual avatar is overlaid on the representation of thesubject with an opacity less than 100%. In some embodiments, a portionof the subject that is determined based on a relative position of thedisplayed virtual avatar and a simulated light source that is,optionally, determined based on a detected light source in the field ofview of the camera. In some embodiments, one or more characteristics(e.g., position, intensity, shape, etc.) of the displayed representationof the shadow are based on a shape of the virtual avatar. In someembodiments, a shape of the displayed shadow is determined based on theshape of the virtual avatar such that different avatars appear to castshadows of different shapes.

In some embodiments, one or more characteristics (e.g., position,intensity, shape, etc.) of the displayed representation of the shadow(e.g., 1430, 1436) are based on a lighting condition (e.g., a detectedamount of ambient light, a detected light source, etc.) in the field ofview of the one or more cameras (e.g., 602). In some embodiments, theposition of the shadow is determined based on a position of a lightsource in the field of view of the camera. For example, if a lightsource (e.g., a detected light source or a modeled light source) ispositioned behind the representation of the subject (e.g., 1422) in thefield of view of the camera, the shadow is positioned on therepresentation of the subject opposite from the position of the lightsource relative to the representation of the subject. In someembodiments, the intensity of the shadow is determined based on thebrightness of the lighting conditions detected in the field of view ofthe one or more cameras (e.g., the shadow is more intense (distinct,darker, etc.) for brighter lighting conditions, and less intense fordarker lighting conditions).

In some embodiments, one or more characteristics (e.g., position,intensity, shape, etc.) of the displayed representation of the shadow(e.g., 1430, 1436) are based on the depth data. In some embodiments, theposition and/or shape of the shadow is determined using the depth data(e.g., in the form of a depth map or depth mask) to provide a morerealistic representation of the shadow effect that is based on thethree-dimensional positioning of the representation of the subject(e.g., 1422) in the field of view of the one or more cameras (e.g., sothat the shadow of the avatar appears to fall onto the subject based ona simulated distance from the avatar to the subject and a simulateddistance from the light source to the avatar).

Note that details of the processes described above with respect tomethod 1500 (e.g., FIGS. 15A-15B) are also applicable in an analogousmanner to the methods described above. For example, method 700optionally includes one or more of the characteristics of the variousmethods described above with reference to method 1500. For example,visual effects such as virtual avatars are displayed in image data in amessaging application user interface. For another example, method 900optionally includes one or more of the characteristics of the variousmethods described above with reference to method 1500. For example,visual effects such as virtual avatars are displayed in image data in acamera user interface. For another example, method 1100 optionallyincludes one or more of the characteristics of the various methodsdescribed above with reference to method 1500. For example, visualeffects such as virtual avatars are displayed in image data in a mediauser interface. For another example, method 1300 optionally includes oneor more of the characteristics of the various methods described abovewith reference to method 1500. For example, visual effects such asvirtual avatars are displayed in image data in a user interface for alive video communication session. For brevity, these details are notrepeated below.

The foregoing description, for purpose of explanation, has beendescribed with reference to specific embodiments. However, theillustrative discussions above are not intended to be exhaustive or tolimit the invention to the precise forms disclosed. Many modificationsand variations are possible in view of the above teachings. Theembodiments were chosen and described in order to best explain theprinciples of the techniques and their practical applications. Othersskilled in the art are thereby enabled to best utilize the techniquesand various embodiments with various modifications as are suited to theparticular use contemplated.

Although the disclosure and examples have been fully described withreference to the accompanying drawings, it is to be noted that variouschanges and modifications will become apparent to those skilled in theart. Such changes and modifications are to be understood as beingincluded within the scope of the disclosure and examples as defined bythe claims.

As described above, one aspect of the present technology is thegathering and use of data available from various sources for sharingwith other users. The present disclosure contemplates that in someinstances, this gathered data may include personal information data thatuniquely identifies or can be used to contact or locate a specificperson. Such personal information data can include demographic data,location-based data, telephone numbers, email addresses, twitter IDs,home addresses, data or records relating to a user's health or level offitness (e.g., vital signs measurements, medication information,exercise information), date of birth, or any other identifying orpersonal information.

The present disclosure recognizes that the use of such personalinformation data, in the present technology, can be used to the benefitof users. For example, the personal information data can be used tobetter represent a user in a conversation. Further, other uses forpersonal information data that benefit the user are also contemplated bythe present disclosure. For instance, health and fitness data may beused to provide insights into a user's general wellness, or may be usedas positive feedback to individuals using technology to pursue wellnessgoals.

The present disclosure contemplates that the entities responsible forthe collection, analysis, disclosure, transfer, storage, or other use ofsuch personal information data will comply with well-established privacypolicies and/or privacy practices. In particular, such entities shouldimplement and consistently use privacy policies and practices that aregenerally recognized as meeting or exceeding industry or governmentalrequirements for maintaining personal information data private andsecure. Such policies should be easily accessible by users, and shouldbe updated as the collection and/or use of data changes. Personalinformation from users should be collected for legitimate and reasonableuses of the entity and not shared or sold outside of those legitimateuses. Further, such collection/sharing should occur after receiving theinformed consent of the users. Additionally, such entities shouldconsider taking any needed steps for safeguarding and securing access tosuch personal information data and ensuring that others with access tothe personal information data adhere to their privacy policies andprocedures. Further, such entities can subject themselves to evaluationby third parties to certify their adherence to widely accepted privacypolicies and practices. In addition, policies and practices should beadapted for the particular types of personal information data beingcollected and/or accessed and adapted to applicable laws and standards,including jurisdiction-specific considerations. For instance, in the US,collection of or access to certain health data may be governed byfederal and/or state laws, such as the Health Insurance Portability andAccountability Act (HIPAA); whereas health data in other countries maybe subject to other regulations and policies and should be handledaccordingly. Hence different privacy practices should be maintained fordifferent personal data types in each country.

Despite the foregoing, the present disclosure also contemplatesembodiments in which users selectively block the use of, or access to,personal information data. That is, the present disclosure contemplatesthat hardware and/or software elements can be provided to prevent orblock access to such personal information data. For example, in the caseof sending an avatar, the present technology can be configured to allowusers to select to “opt in” or “opt out” of participation in thecollection of personal information data during registration for servicesor anytime thereafter. In addition to providing “opt in” and “opt out”options, the present disclosure contemplates providing notificationsrelating to the access or use of personal information. For instance, auser may be notified upon downloading an app that their personalinformation data will be accessed and then reminded again just beforepersonal information data is accessed by the app.

Moreover, it is the intent of the present disclosure that personalinformation data should be managed and handled in a way to minimizerisks of unintentional or unauthorized access or use. Risk can beminimized by limiting the collection of data and deleting data once itis no longer needed. In addition, and when applicable, including incertain health related applications, data de-identification can be usedto protect a user's privacy. De-identification may be facilitated, whenappropriate, by removing specific identifiers (e.g., date of birth,etc.), controlling the amount or specificity of data stored (e.g.,collecting location data a city level rather than at an address level),controlling how data is stored (e.g., aggregating data across users),and/or other methods.

Therefore, although the present disclosure broadly covers use ofpersonal information data to implement one or more various disclosedembodiments, the present disclosure also contemplates that the variousembodiments can also be implemented without the need for accessing suchpersonal information data. That is, the various embodiments of thepresent technology are not rendered inoperable due to the lack of all ora portion of such personal information data.

What is claimed is:
 1. An electronic device, comprising: a camera; adisplay apparatus; one or more input devices; one or more processors;and memory storing one or more programs configured to be executed by theone or more processors, the one or more programs including instructionsfor: displaying, via the display apparatus, a messaging user interfaceof a message conversation including at least a first participant, themessaging user interface including a camera affordance; detecting, viathe one or more input devices, a first input directed to the cameraaffordance; in response to detecting the first input, displaying acamera user interface, the camera user interface including a captureaffordance, wherein the camera user interface further includes aneffects mode affordance associated with a mode in which visual effectsare enabled for display in image data that is captured; detecting, viathe one or more input devices, a selection of the effects modeaffordance; in response to detecting the selection of the effects modeaffordance, transitioning the electronic device from a first camera modeto a second camera mode, different from the first camera mode;detecting, via the one or more input devices, a second input directed tothe capture affordance; in response to detecting the second input:capturing image data using the camera; ceasing to display the captureaffordance; and displaying a send affordance; detecting, via the one ormore input devices, a third input directed to the send affordance; andin response to detecting the third input, initiating a process to sendthe captured image data to the first participant.
 2. The electronicdevice of claim 1, the one or more programs further includinginstructions for: further in response to detecting selection of theeffects mode affordance: ceasing to display one or more camera modeaffordances; and displaying a plurality of effects option affordances ata location in the camera user interface that was previously occupied bythe one or more camera mode affordances.
 3. The electronic device ofclaim 2, the one or more programs further including instructions for:detecting, via the one or more input devices, selection of a first oneof the effects option affordances; and in response to detectingselection of the first one of the effects option affordances: ceasing todisplay the plurality of effects option affordances; and displaying aplurality of selectable graphical icons.
 4. The electronic device ofclaim 2, the one or more programs further including instructions for:detecting, via the one or more input devices, selection of a second oneof the effects option affordances; and in response to detectingselection of the second one of the effects option affordances: ceasingto display the plurality of effects option affordances; and displayingan avatar selection region having a plurality of avatar affordances. 5.The electronic device of claim 4, the one or more programs furtherincluding instructions for: detecting, via the one or more inputdevices, a swipe input on the avatar selection region; and in responseto detecting the swipe input on the avatar selection region: increasinga size of the avatar selection region and displaying the plurality ofavatar affordances arranged in a matrix.
 6. The electronic device ofclaim 1, wherein the camera user interface further includes a firstrepresentation of image data, the one or more programs further includinginstructions for: further in response to detecting selection of theeffects mode affordance: in accordance with a determination that thefirst representation of image data corresponds to image data obtainedfrom a second camera: ceasing to display the first representation ofimage data; and displaying a second representation of image data, thesecond representation of image data corresponding to image data obtainedfrom the camera.
 7. The electronic device of claim 1, the one or moreprograms further including instructions for: while the electronic deviceis in the second camera mode, receiving a request to transition to thefirst camera mode; and in response to receiving the request totransition to the first camera mode: transitioning the electronic devicefrom the second camera mode to the first camera mode, and in accordancewith a first visual effect being active, deactivating the first visualeffect.
 8. The electronic device of claim 7, the one or more programsfurther including instructions for: after deactivating the first visualeffect: detecting, via the one or more input devices, subsequentselection of the effects mode affordance; and in response to detectingthe subsequent selection of the effects mode affordance: in accordancewith a determination that the subsequent selection of the effects modeaffordance occurs within a predetermined amount of time afterdeactivating the first visual effect, re-activating the first visualeffect.
 9. The electronic device of claim 1, wherein prior to detectingthe third input, the electronic device is in a first image capture mode,the one or more programs further including instructions for: further inresponse to detecting the third input, maintaining the first imagecapture mode.
 10. The electronic device of claim 1, wherein initiatingthe process to send the captured image data to the first participantincludes sending the captured image data to the first participant. 11.The electronic device of claim 1, wherein capturing the image dataincludes: in accordance with a value of a characteristic of the secondinput meeting a first capture mode criteria, capturing the image data ina first image capture mode; and in accordance with the value of thecharacteristic of the second input meeting a second capture modecriteria, capturing the image data in a second image capture mode. 12.The electronic device of claim 1, wherein the send affordance isdisplayed in a region in the camera user interface in which the captureaffordance was previously displayed.
 13. A non-transitorycomputer-readable storage medium storing one or more programs configuredto be executed by one or more processors of an electronic device with acamera, a display apparatus, and one or more input devices, the one ormore programs including instructions for: displaying, via the displayapparatus, a messaging user interface of a message conversationincluding at least a first participant, the messaging user interfaceincluding a camera affordance; detecting, via the one or more inputdevices, a first input directed to the camera affordance; in response todetecting the first input, displaying a camera user interface, thecamera user interface including a capture affordance, wherein the camerauser interface further includes an effects mode affordance associatedwith a mode in which visual effects are enabled for display in imagedata that is captured; detecting, via the one or more input devices, aselection of the effects mode affordance; in response to detecting theselection of the effects mode affordance, transitioning the electronicdevice from a first camera mode to a second camera mode, different fromthe first camera mode; detecting, via the one or more input devices, asecond input directed to the capture affordance; in response todetecting the second input: capturing image data using the camera;ceasing to display the capture affordance; and displaying a sendaffordance; detecting, via the one or more input devices, a third inputdirected to the send affordance; and in response to detecting the thirdinput, initiating a process to send the captured image data to the firstparticipant.
 14. The non-transitory computer-readable storage medium ofclaim 13, the one or more programs further including instructions for:while displaying the send affordance, displaying a representation of thefirst participant.
 15. The non-transitory computer-readable storagemedium of claim 14, wherein the representation of the first participantis not displayed prior to capturing the image data using the camera. 16.The non-transitory computer-readable storage medium of claim 13, the oneor more programs further including instructions for: further in responseto detecting selection of the effects mode affordance: ceasing todisplay one or more camera mode affordances; and displaying a pluralityof effects option affordances at a location in the camera user interfacethat was previously occupied by the one or more camera mode affordances.17. The non-transitory computer-readable storage medium of claim 16, theone or more programs further including instructions for: detecting, viathe one or more input devices, selection of a first one of the effectsoption affordances; and in response to detecting selection of the firstone of the effects option affordances: ceasing to display the pluralityof effects option affordances; and displaying a plurality of selectablegraphical icons.
 18. The non-transitory computer-readable storage mediumof claim 16, the one or more programs further including instructionsfor: detecting, via the one or more input devices, selection of a secondone of the effects option affordances; and in response to detectingselection of the second one of the effects option affordances: ceasingto display the plurality of effects option affordances; and displayingan avatar selection region having a plurality of avatar affordances. 19.The non-transitory computer-readable storage medium of claim 18, the oneor more programs further including instructions for: detecting, via theone or more input devices, a swipe input on the avatar selection region;and in response to detecting the swipe input on the avatar selectionregion: increasing a size of the avatar selection region and displayingthe plurality of avatar affordances arranged in a matrix.
 20. Thenon-transitory computer-readable storage medium of claim 13, wherein thecamera user interface further includes a first representation of imagedata, the one or more programs further including instructions for:further in response to detecting selection of the effects modeaffordance: in accordance with a determination that the firstrepresentation of image data corresponds to image data obtained from asecond camera: ceasing to display the first representation of imagedata; and displaying a second representation of image data, the secondrepresentation of image data corresponding to image data obtained fromthe camera.
 21. The non-transitory computer-readable storage medium ofclaim 13, the one or more programs further including instructions for:while the electronic device is in the second camera mode, receiving arequest to transition to the first camera mode; and in response toreceiving the request to transition to the first camera mode:transitioning the electronic device from the second camera mode to thefirst camera mode, and in accordance with a first visual effect beingactive, deactivating the first visual effect.
 22. The non-transitorycomputer-readable storage medium of claim 21, the one or more programsfurther including instructions for: after deactivating the first visualeffect: detecting, via the one or more input devices, subsequentselection of the effects mode affordance; and in response to detectingthe subsequent selection of the effects mode affordance: in accordancewith a determination that the subsequent selection of the effects modeaffordance occurs within a predetermined amount of time afterdeactivating the first visual effect, re-activating the first visualeffect.
 23. The non-transitory computer-readable storage medium of claim13, wherein prior to detecting the third input, the electronic device isin a first image capture mode, the one or more programs furtherincluding instructions for: further in response to detecting the thirdinput, maintaining the first image capture mode.
 24. The non-transitorycomputer-readable storage medium of claim 13, wherein initiating theprocess to send the captured image data to the first participantincludes sending the captured image data to the first participant. 25.The non-transitory computer-readable storage medium of claim 13, whereincapturing the image data includes: in accordance with a value of acharacteristic of the second input meeting a first capture modecriteria, capturing the image data in a first image capture mode; and inaccordance with the value of the characteristic of the second inputmeeting a second capture mode criteria, capturing the image data in asecond image capture mode.
 26. The non-transitory computer-readablestorage medium of claim 13, wherein the send affordance is displayed ina region in the camera user interface in which the capture affordancewas previously displayed.
 27. A method, comprising: at an electronicdevice having a camera, a display apparatus, and one or more inputdevices: displaying, via the display apparatus, a messaging userinterface of a message conversation including at least a firstparticipant, the messaging user interface including a camera affordance;detecting, via the one or more input devices, a first input directed tothe camera affordance; in response to detecting the first input,displaying a camera user interface, the camera user interface includinga capture affordance, wherein the camera user interface further includesan effects mode affordance associated with a mode in which visualeffects are enabled for display in image data that is captured;detecting, via the one or more input devices, a selection of the effectsmode affordance; in response to detecting the selection of the effectsmode affordance, transitioning the electronic device from a first cameramode to a second camera mode, different from the first camera mode;detecting, via the one or more input devices, a second input directed tothe capture affordance; in response to detecting the second input:capturing image data using the camera; ceasing to display the captureaffordance; and displaying a send affordance; detecting, via the one ormore input devices, a third input directed to the send affordance; andin response to detecting the third input, initiating a process to sendthe captured image data to the first participant.
 28. The method ofclaim 27, the method further comprising: while displaying the sendaffordance, displaying a representation of the first participant. 29.The method of claim 28, wherein the representation of the firstparticipant is not displayed prior to capturing the image data using thecamera.
 30. The method of claim 27, the method further comprising:further in response to detecting selection of the effects modeaffordance: ceasing to display one or more camera mode affordances; anddisplaying a plurality of effects option affordances at a location inthe camera user interface that was previously occupied by the one ormore camera mode affordances.
 31. The method of claim 30, the methodfurther comprising: detecting, via the one or more input devices,selection of a first one of the effects option affordances; and inresponse to detecting selection of the first one of the effects optionaffordances: ceasing to display the plurality of effects optionaffordances; and displaying a plurality of selectable graphical icons.32. The method of claim 30, the method further comprising: detecting,via the one or more input devices, selection of a second one of theeffects option affordances; and in response to detecting selection ofthe second one of the effects option affordances: ceasing to display theplurality of effects option affordances; and displaying an avatarselection region having a plurality of avatar affordances.
 33. Themethod of claim 32, the method further comprising: detecting, via theone or more input devices, a swipe input on the avatar selection region;and in response to detecting the swipe input on the avatar selectionregion: increasing a size of the avatar selection region and displayingthe plurality of avatar affordances arranged in a matrix.
 34. The methodof claim 27, wherein the camera user interface further includes a firstrepresentation of image data, the method further comprising: further inresponse to detecting selection of the effects mode affordance: inaccordance with a determination that the first representation of imagedata corresponds to image data obtained from a second camera: ceasing todisplay the first representation of image data; and displaying a secondrepresentation of image data, the second representation of image datacorresponding to image data obtained from the camera.
 35. The method ofclaim 27, the method further comprising: while the electronic device isin the second camera mode, receiving a request to transition to thefirst camera mode; and in response to receiving the request totransition to the first camera mode: transitioning the electronic devicefrom the second camera mode to the first camera mode, and in accordancewith a first visual effect being active, deactivating the first visualeffect.
 36. The method of claim 35, the method further comprising: afterdeactivating the first visual effect: detecting, via the one or moreinput devices, subsequent selection of the effects mode affordance; andin response to detecting the subsequent selection of the effects modeaffordance: in accordance with a determination that the subsequentselection of the effects mode affordance occurs within a predeterminedamount of time after deactivating the first visual effect, re-activatingthe first visual effect.
 37. The method of claim 27, wherein prior todetecting the third input, the electronic device is in a first imagecapture mode, the method further comprising: further in response todetecting the third input, maintaining the first image capture mode. 38.The method of claim 27, wherein initiating the process to send thecaptured image data to the first participant includes sending thecaptured image data to the first participant.
 39. The method of claim27, wherein capturing the image data includes: in accordance with avalue of a characteristic of the second input meeting a first capturemode criteria, capturing the image data in a first image capture mode;and in accordance with the value of the characteristic of the secondinput meeting a second capture mode criteria, capturing the image datain a second image capture mode.
 40. The method of claim 27, wherein thesend affordance is displayed in a region in the camera user interface inwhich the capture affordance was previously displayed.
 41. An electronicdevice, comprising: a camera; a display apparatus; one or more inputdevices; one or more processors; and memory storing one or more programsconfigured to be executed by the one or more processors, the one or moreprograms including instructions for: displaying, via the displayapparatus, a messaging user interface of a message conversationincluding at least a first participant, the messaging user interfaceincluding a camera affordance; detecting, via the one or more inputdevices, a first input directed to the camera affordance; in response todetecting the first input, displaying a camera user interface, thecamera user interface including a capture affordance; detecting, via theone or more input devices, a second input directed to the captureaffordance; in response to detecting the second input: capturing imagedata using the camera; ceasing to display the capture affordance; anddisplaying a send affordance; detecting, via the one or more inputdevices, a third input directed to the send affordance; in response todetecting the third input, initiating a process to send the capturedimage data to the first participant, wherein initiating the process tosend the captured image data to the first participant includesre-displaying the messaging user interface, the messaging user interfacefurther including a keyboard region and an application menu affordance;detecting, via the one or more input devices, selection of theapplication menu affordance; and in response to detecting selection ofthe application menu affordance, displaying an application menu regionadjacent the keyboard region, the application menu region having aplurality of application affordances.
 42. The electronic device of claim41, the one or more programs further including instructions for:detecting a fourth input on the messaging user interface; and inaccordance with a determination the fourth input corresponds to alocation of the keyboard region or a location of a message-composeregion in the messaging user interface: ceasing to display theapplication menu region; and displaying a text-suggestion region at alocation in the messaging user interface that was previously occupied bythe application menu region.
 43. The electronic device of claim 41, theone or more programs further including instructions for: detectingselection of one of the plurality of application affordances in theapplication menu region; and in response to detecting selection of theapplication affordance: ceasing to display the keyboard region; anddisplaying an application display region at a location in the messaginguser interface that was previously occupied by the keyboard region, theapplication display region including a plurality of graphical objectscorresponding to the selected application affordance.
 44. Anon-transitory computer-readable storage medium storing one or moreprograms configured to be executed by one or more processors of anelectronic device with a camera, a display apparatus, and one or moreinput devices, the one or more programs including instructions for:displaying, via the display apparatus, a messaging user interface of amessage conversation including at least a first participant, themessaging user interface including a camera affordance; detecting, viathe one or more input devices, a first input directed to the cameraaffordance; in response to detecting the first input, displaying acamera user interface, the camera user interface including a captureaffordance; detecting, via the one or more input devices, a second inputdirected to the capture affordance; in response to detecting the secondinput: capturing image data using the camera; ceasing to display thecapture affordance; and displaying a send affordance; detecting, via theone or more input devices, a third input directed to the sendaffordance; in response to detecting the third input, initiating aprocess to send the captured image data to the first participant,wherein initiating the process to send the captured image data to thefirst participant includes re-displaying the messaging user interface,the messaging user interface further including a keyboard region and anapplication menu affordance; detecting, via the one or more inputdevices, selection of the application menu affordance; and in responseto detecting selection of the application menu affordance, displaying anapplication menu region adjacent the keyboard region, the applicationmenu region having a plurality of application affordances.
 45. Thenon-transitory computer-readable storage medium of claim 44, the one ormore programs further including instructions for: detecting a fourthinput on the messaging user interface; and in accordance with adetermination the fourth input corresponds to a location of the keyboardregion or a location of a message-compose region in the messaging userinterface: ceasing to display the application menu region; anddisplaying a text-suggestion region at a location in the messaging userinterface that was previously occupied by the application menu region.46. The non-transitory computer-readable storage medium of claim 44, theone or more programs further including instructions for: detectingselection of one of the plurality of application affordances in theapplication menu region; and in response to detecting selection of theapplication affordance: ceasing to display the keyboard region; anddisplaying an application display region at a location in the messaginguser interface that was previously occupied by the keyboard region, theapplication display region including a plurality of graphical objectscorresponding to the selected application affordance.
 47. A method,comprising: at an electronic device having a camera, a displayapparatus, and one or more input devices: displaying, via the displayapparatus, a messaging user interface of a message conversationincluding at least a first participant, the messaging user interfaceincluding a camera affordance; detecting, via the one or more inputdevices, a first input directed to the camera affordance; in response todetecting the first input, displaying a camera user interface, thecamera user interface including a capture affordance; detecting, via theone or more input devices, a second input directed to the captureaffordance; in response to detecting the second input: capturing imagedata using the camera; ceasing to display the capture affordance; anddisplaying a send affordance; detecting, via the one or more inputdevices, a third input directed to the send affordance; in response todetecting the third input, initiating a process to send the capturedimage data to the first participant, wherein initiating the process tosend the captured image data to the first participant includesre-displaying the messaging user interface, the messaging user interfacefurther including a keyboard region and an application menu affordance;detecting, via the one or more input devices, selection of theapplication menu affordance; and in response to detecting selection ofthe application menu affordance, displaying an application menu regionadjacent the keyboard region, the application menu region having aplurality of application affordances.
 48. The method of claim 47, methodfurther comprising: detecting a fourth input on the messaging userinterface; and in accordance with a determination the fourth inputcorresponds to a location of the keyboard region or a location of amessage-compose region in the messaging user interface: ceasing todisplay the application menu region; and displaying a text-suggestionregion at a location in the messaging user interface that was previouslyoccupied by the application menu region.
 49. The method of claim 47, themethod further comprising: detecting selection of one of the pluralityof application affordances in the application menu region; and inresponse to detecting selection of the application affordance: ceasingto display the keyboard region; and displaying an application displayregion at a location in the messaging user interface that was previouslyoccupied by the keyboard region, the application display regionincluding a plurality of graphical objects corresponding to the selectedapplication affordance.
 50. An electronic device, comprising: a camera;a display apparatus; one or more input devices; one or more processors;and memory storing one or more programs configured to be executed by theone or more processors, the one or more programs including instructionsfor: displaying, via the display apparatus, a messaging user interfaceof a message conversation including at least a first participant, themessaging user interface including a camera affordance; detecting, viathe one or more input devices, a first input directed to the cameraaffordance; in response to detecting the first input, displaying acamera user interface, the camera user interface including a captureaffordance; detecting, via the one or more input devices, a second inputdirected to the capture affordance; in response to detecting the secondinput: capturing image data using the camera; ceasing to display thecapture affordance; and displaying a send affordance; detecting, via theone or more input devices, a third input directed to the sendaffordance; in response to detecting the third input, initiating aprocess to send the captured image data to the first participant; andprior to detecting the third input directed to the send affordance:displaying a done affordance; detecting, via the one or more inputdevices, selection of the done affordance; and in response to detectingselection of the done affordance: displaying the messaging userinterface, the messaging user interface having a message-compose region;and displaying a representation of the captured image data in themessage-compose region.
 51. A non-transitory computer-readable storagemedium storing one or more programs configured to be executed by one ormore processors of an electronic device with a camera, a displayapparatus, and one or more input devices, the one or more programsincluding instructions for: displaying, via the display apparatus, amessaging user interface of a message conversation including at least afirst participant, the messaging user interface including a cameraaffordance; detecting, via the one or more input devices, a first inputdirected to the camera affordance; in response to detecting the firstinput, displaying a camera user interface, the camera user interfaceincluding a capture affordance; detecting, via the one or more inputdevices, a second input directed to the capture affordance; in responseto detecting the second input: capturing image data using the camera;ceasing to display the capture affordance; and displaying a sendaffordance; detecting, via the one or more input devices, a third inputdirected to the send affordance; in response to detecting the thirdinput, initiating a process to send the captured image data to the firstparticipant; and prior to detecting the third input directed to the sendaffordance: displaying a done affordance; detecting, via the one or moreinput devices, selection of the done affordance; and in response todetecting selection of the done affordance: displaying the messaginguser interface, the messaging user interface having a message-composeregion; and displaying a representation of the captured image data inthe message-compose region.
 52. A method, comprising: at an electronicdevice having a camera, a display apparatus, and one or more inputdevices: displaying, via the display apparatus, a messaging userinterface of a message conversation including at least a firstparticipant, the messaging user interface including a camera affordance;detecting, via the one or more input devices, a first input directed tothe camera affordance; in response to detecting the first input,displaying a camera user interface, the camera user interface includinga capture affordance; detecting, via the one or more input devices, asecond input directed to the capture affordance; in response todetecting the second input: capturing image data using the camera;ceasing to display the capture affordance; and displaying a sendaffordance; detecting, via the one or more input devices, a third inputdirected to the send affordance; in response to detecting the thirdinput, initiating a process to send the captured image data to the firstparticipant; and prior to detecting the third input directed to the sendaffordance: displaying a done affordance; detecting, via the one or moreinput devices, selection of the done affordance; and in response todetecting selection of the done affordance: displaying the messaginguser interface, the messaging user interface having a message-composeregion; and displaying a representation of the captured image data inthe message-compose region.
 53. An electronic device, comprising: acamera; a display apparatus; one or more input devices; one or moreprocessors; and memory storing one or more programs configured to beexecuted by the one or more processors, the one or more programsincluding instructions for: displaying, via the display apparatus, amessaging user interface of a message conversation including at least afirst participant, the messaging user interface including a cameraaffordance; detecting, via the one or more input devices, a first inputdirected to the camera affordance; in response to detecting the firstinput, displaying a camera user interface, the camera user interfaceincluding a capture affordance; detecting, via the one or more inputdevices, a second input directed to the capture affordance; in responseto detecting the second input: capturing image data using the camera;ceasing to display the capture affordance; and displaying a sendaffordance; detecting, via the one or more input devices, a third inputdirected to the send affordance; in response to detecting the thirdinput, initiating a process to send the captured image data to the firstparticipant; after capturing image data using the camera, displaying amark-up affordance, an edit affordance, and a retake affordance; whiledisplaying the mark-up affordance, the edit affordance, and the retakeaffordance, receiving, via the one or more input devices, a fifth input;and in response to detecting the fifth input: in accordance with thefifth input corresponding to the edit affordance, initiating a processfor editing the captured image data; in accordance with the fifth inputcorresponding to the mark-up affordance, initiating a process formarking-up the captured image data; and in accordance with the fifthinput corresponding to the retake affordance, initiating a process forretaking the captured image data.
 54. A non-transitory computer-readablestorage medium storing one or more programs configured to be executed byone or more processors of an electronic device with a camera, a displayapparatus, and one or more input devices, the one or more programsincluding instructions for: displaying, via the display apparatus, amessaging user interface of a message conversation including at least afirst participant, the messaging user interface including a cameraaffordance; detecting, via the one or more input devices, a first inputdirected to the camera affordance; in response to detecting the firstinput, displaying a camera user interface, the camera user interfaceincluding a capture affordance; detecting, via the one or more inputdevices, a second input directed to the capture affordance; in responseto detecting the second input: capturing image data using the camera;ceasing to display the capture affordance; and displaying a sendaffordance; detecting, via the one or more input devices, a third inputdirected to the send affordance; in response to detecting the thirdinput, initiating a process to send the captured image data to the firstparticipant; after capturing image data using the camera, displaying amark-up affordance, an edit affordance, and a retake affordance; whiledisplaying the mark-up affordance, the edit affordance, and the retakeaffordance, receiving, via the one or more input devices, a fifth input;and in response to detecting the fifth input: in accordance with thefifth input corresponding to the edit affordance, initiating a processfor editing the captured image data; in accordance with the fifth inputcorresponding to the mark-up affordance, initiating a process formarking-up the captured image data; and in accordance with the fifthinput corresponding to the retake affordance, initiating a process forretaking the captured image data.
 55. A method, comprising: at anelectronic device having a camera, a display apparatus, and one or moreinput devices: displaying, via the display apparatus, a messaging userinterface of a message conversation including at least a firstparticipant, the messaging user interface including a camera affordance;detecting, via the one or more input devices, a first input directed tothe camera affordance; in response to detecting the first input,displaying a camera user interface, the camera user interface includinga capture affordance; detecting, via the one or more input devices, asecond input directed to the capture affordance; in response todetecting the second input: capturing image data using the camera;ceasing to display the capture affordance; and displaying a sendaffordance; detecting, via the one or more input devices, a third inputdirected to the send affordance; in response to detecting the thirdinput, initiating a process to send the captured image data to the firstparticipant; after capturing image data using the camera, displaying amark-up affordance, an edit affordance, and a retake affordance; whiledisplaying the mark-up affordance, the edit affordance, and the retakeaffordance, receiving, via the one or more input devices, a fifth input;and in response to detecting the fifth input: in accordance with thefifth input corresponding to the edit affordance, initiating a processfor editing the captured image data; in accordance with the fifth inputcorresponding to the mark-up affordance, initiating a process formarking-up the captured image data; and in accordance with the fifthinput corresponding to the retake affordance, initiating a process forretaking the captured image data.
 56. An electronic device, comprising:a camera; a display apparatus; one or more input devices; one or moreprocessors; and memory storing one or more programs configured to beexecuted by the one or more processors, the one or more programsincluding instructions for: displaying, via the display apparatus, amessaging user interface of a message conversation including at least afirst participant, the messaging user interface including a cameraaffordance; detecting, via the one or more input devices, a first inputdirected to the camera affordance; in response to detecting the firstinput, displaying a camera user interface, the camera user interfaceincluding a capture affordance, wherein the camera user interface, whiledisplaying the capture affordance, further includes a camera displayregion including a representation of a live preview of a field of viewof the camera; detecting, via the one or more input devices, a secondinput directed to the capture affordance; in response to detecting thesecond input: capturing image data using the camera; ceasing to displaythe capture affordance; and displaying a send affordance; detecting, viathe one or more input devices, a third input directed to the sendaffordance; in response to detecting the third input, initiating aprocess to send the captured image data to the first participant; whilea subject is positioned within the field of view of the camera and arepresentation of the subject and a background are displayed in thecamera display region, displaying a representation of a selected avataron the representation of the subject in the camera display region; whiledisplaying the representation of the selected avatar on therepresentation of the subject in the camera display region, receiving arequest to display an avatar selection region; in response to receivingthe request to display an avatar selection region: ceasing to displaythe capture affordance; and displaying an avatar selection region havinga plurality of avatar affordances.
 57. A non-transitorycomputer-readable storage medium storing one or more programs configuredto be executed by one or more processors of an electronic device with acamera, a display apparatus, and one or more input devices, the one ormore programs including instructions for: displaying, via the displayapparatus, a messaging user interface of a message conversationincluding at least a first participant, the messaging user interfaceincluding a camera affordance; detecting, via the one or more inputdevices, a first input directed to the camera affordance; in response todetecting the first input, displaying a camera user interface, thecamera user interface including a capture affordance, wherein the camerauser interface, while displaying the capture affordance, furtherincludes a camera display region including a representation of a livepreview of a field of view of the camera; detecting, via the one or moreinput devices, a second input directed to the capture affordance; inresponse to detecting the second input: capturing image data using thecamera; ceasing to display the capture affordance; and displaying a sendaffordance; detecting, via the one or more input devices, a third inputdirected to the send affordance; in response to detecting the thirdinput, initiating a process to send the captured image data to the firstparticipant; while a subject is positioned within the field of view ofthe camera and a representation of the subject and a background aredisplayed in the camera display region, displaying a representation of aselected avatar on the representation of the subject in the cameradisplay region; while displaying the representation of the selectedavatar on the representation of the subject in the camera displayregion, receiving a request to display an avatar selection region; inresponse to receiving the request to display an avatar selection region:ceasing to display the capture affordance; and displaying an avatarselection region having a plurality of avatar affordances.
 58. A method,comprising: at an electronic device having a camera, a displayapparatus, and one or more input devices: displaying, via the displayapparatus, a messaging user interface of a message conversationincluding at least a first participant, the messaging user interfaceincluding a camera affordance; detecting, via the one or more inputdevices, a first input directed to the camera affordance; in response todetecting the first input, displaying a camera user interface, thecamera user interface including a capture affordance, wherein the camerauser interface, while displaying the capture affordance, furtherincludes a camera display region including a representation of a livepreview of a field of view of the camera; detecting, via the one or moreinput devices, a second input directed to the capture affordance; inresponse to detecting the second input: capturing image data using thecamera; ceasing to display the capture affordance; and displaying a sendaffordance; detecting, via the one or more input devices, a third inputdirected to the send affordance; in response to detecting the thirdinput, initiating a process to send the captured image data to the firstparticipant; while a subject is positioned within the field of view ofthe camera and a representation of the subject and a background aredisplayed in the camera display region, displaying a representation of aselected avatar on the representation of the subject in the cameradisplay region; while displaying the representation of the selectedavatar on the representation of the subject in the camera displayregion, receiving a request to display an avatar selection region; inresponse to receiving the request to display an avatar selection region:ceasing to display the capture affordance; and displaying an avatarselection region having a plurality of avatar affordances.
 59. Anelectronic device, comprising: a camera; a display apparatus; one ormore input devices; one or more processors; and memory storing one ormore programs configured to be executed by the one or more processors,the one or more programs including instructions for: displaying, via thedisplay apparatus, a messaging user interface of a message conversationincluding at least a first participant, the messaging user interfaceincluding a camera affordance; detecting, via the one or more inputdevices, a first input directed to the camera affordance; in response todetecting the first input, displaying a camera user interface, thecamera user interface including a capture affordance, wherein the camerauser interface, while displaying the capture affordance, furtherincludes a camera display region including a representation of a livepreview of a field of view of the camera; detecting, via the one or moreinput devices, a second input directed to the capture affordance; inresponse to detecting the second input: capturing image data using thecamera; ceasing to display the capture affordance; and displaying a sendaffordance; detecting, via the one or more input devices, a third inputdirected to the send affordance; in response to detecting the thirdinput, initiating a process to send the captured image data to the firstparticipant; while a subject is positioned within the field of view ofthe camera and a representation of the subject and a background aredisplayed in the camera display region, displaying a representation of aselected avatar on the representation of the subject in the cameradisplay region; while displaying the representation of the selectedavatar on the representation of the subject in the camera displayregion, detecting a change in pose of the subject; and in response todetecting the change in pose of the subject, changing an appearance ofthe displayed representation of the selected avatar based on thedetected change in pose of the subject while maintaining display of thebackground.
 60. A non-transitory computer-readable storage mediumstoring one or more programs configured to be executed by one or moreprocessors of an electronic device with a camera, a display apparatus,and one or more input devices, the one or more programs includinginstructions for: displaying, via the display apparatus, a messaginguser interface of a message conversation including at least a firstparticipant, the messaging user interface including a camera affordance;detecting, via the one or more input devices, a first input directed tothe camera affordance; in response to detecting the first input,displaying a camera user interface, the camera user interface includinga capture affordance, wherein the camera user interface, whiledisplaying the capture affordance, further includes a camera displayregion including a representation of a live preview of a field of viewof the camera; detecting, via the one or more input devices, a secondinput directed to the capture affordance; in response to detecting thesecond input: capturing image data using the camera; ceasing to displaythe capture affordance; and displaying a send affordance; detecting, viathe one or more input devices, a third input directed to the sendaffordance; in response to detecting the third input, initiating aprocess to send the captured image data to the first participant; whilea subject is positioned within the field of view of the camera and arepresentation of the subject and a background are displayed in thecamera display region, displaying a representation of a selected avataron the representation of the subject in the camera display region; whiledisplaying the representation of the selected avatar on therepresentation of the subject in the camera display region, detecting achange in pose of the subject; and in response to detecting the changein pose of the subject, changing an appearance of the displayedrepresentation of the selected avatar based on the detected change inpose of the subject while maintaining display of the background.
 61. Amethod, comprising: at an electronic device having a camera, a displayapparatus, and one or more input devices: displaying, via the displayapparatus, a messaging user interface of a message conversationincluding at least a first participant, the messaging user interfaceincluding a camera affordance; detecting, via the one or more inputdevices, a first input directed to the camera affordance; in response todetecting the first input, displaying a camera user interface, thecamera user interface including a capture affordance, wherein the camerauser interface, while displaying the capture affordance, furtherincludes a camera display region including a representation of a livepreview of a field of view of the camera; detecting, via the one or moreinput devices, a second input directed to the capture affordance; inresponse to detecting the second input: capturing image data using thecamera; ceasing to display the capture affordance; and displaying a sendaffordance; detecting, via the one or more input devices, a third inputdirected to the send affordance; in response to detecting the thirdinput, initiating a process to send the captured image data to the firstparticipant; while a subject is positioned within the field of view ofthe camera and a representation of the subject and a background aredisplayed in the camera display region, displaying a representation of aselected avatar on the representation of the subject in the cameradisplay region; while displaying the representation of the selectedavatar on the representation of the subject in the camera displayregion, detecting a change in pose of the subject; and in response todetecting the change in pose of the subject, changing an appearance ofthe displayed representation of the selected avatar based on thedetected change in pose of the subject while maintaining display of thebackground.